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estimating  the  phase  differentials  measured  from  lateral 
shearing  interferometry  on  an  optical  wavefront  had  the 
form  of  a  simple  and  appealing  algorithm.  The  need  for  a 
closer  examination  of  Wyant's  technique  from  a  statistical 
point  of  view  was  a  motivating  factor  for  the  work  of  this 
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performance  from  joint  processing  of  multiple  measurements 
performed  by  real-time  wavefront  correction  systems. 
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Abstract 

H  Wavefront  estimation  from  shearing  interferometry 

measurements  is  considered  in  detail.  Two  analyses  are 
presented,  which  involve  the  estimation  of  constant  phase 
from  single  detector  and  detector  array  measurements.  The 
single  detector  analysis  is  carried  out  in  a  discrete  mode 
to  obtain  algorithms  based  on  photon  counting  as  the  alter¬ 
nate  means  for  use  under  low  light  level  conditions.  The 
method  used  follows  the  Maximum  A  Posteriori  and  Maximum 
Likelihood  estimation  theories.  This  is  done  for  measure¬ 
ments  made  in  both  white  Gaussian  noise  and  Poisson  shot 
noise  limited  conditions.  The  results  so  obtained  are 
trigonometric  relationships  between  the  phases  and  the 
photon  counts.  The  theoretical  performance  results  show  a 
strong  signal-to-noise  ratio  dependence.  Simulation  results 
show  that  signal-to-noise  ratios  of  17  dB  or  better  are 
needed  to  produce  adequate  estimates.  Both  theory  and  sim¬ 
ulation  show  that  an  estimate  improvement  is  obtained  as 
more  photon  counts  are  performed,  and  in  the  limiting  case, 
the  ideal  form  is  a  current  measurement.  In  this  sense, 
although  photon  counting  seems  to  be  inferior  to  current 
measuring,  the  error  variance  is  only  1.65  dB  larger  in  the 
worst  case,  where  three  photon  counts  are  performed. 

The  ML  estimator  was  found  to  be  computationally  sim¬ 
pler  than  the  MAP  estimator,  and  with  similar  performance  for 
SNR's  in  the  order  of  10  dB  and  higher. 


xiv 


An  extension  of  the  single  detector  analysis  is  made, 
using  only  the  Gaussian  noise  assumption,  to  derive  an 
algorithm  that  jointly  estimates  the  phase  distribution 
over  an  optical  wavefront.  The  procedure  is  based  on  a 
parametric  dependence  between  the  measurements  performed  by 
adjacent  detectors,  and  on  the  a  priori  knowledge  available 
through  a  covariance  matrix.  An  algorithm  for  processing 
continuous  waveform  measurements  is  developed,  but  no  com¬ 
puter  simulation  is  included  due  to  difficulties  encount¬ 
ered  in  solving  the  feedback  system  equations. 
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PHASE  ESTIMATION  TECHNIQUES  FOR  ACTIVE 


OPTICS  SYSTEMS  USED  IN  REAL-TIME 
WAVEFRONT  RECONSTRUCTION 

I  Introduction 

Reconstruction  of  a  wavefront  in  real  time  is  of  par¬ 
ticular  interest  to  the  Air  Force  because  of  the  need  to 
compensate  for  atmospheric  disturbances  and  target  varia¬ 
tions  that  adversely  affect  laser  weapons  systems.  Wave- 
front  correction  systems  of  diverse  complexity  are  employed 
to  maximize  the  irradiance  of  the  laser  on  a  target.  The 
laser  beam  is  continuously  shaped  in  real  time  by  means  of 
mirrors  to  reconstruct  the  detected  wavefront  of  the  tar¬ 
get's  radiation.  Actually,  the  complex  conjugate  field  is 
reconstructed  to  propagate  back  to  the  target  a  wavefront 
with  the  same  characteristics  but  in  complementary  form. 
Prior  to  such  reconstruction,  the  phase  distribution  of  the 
wavefront  must  be  estimated  over  the  region  of  space 
enclosed  by  the  aperture  of  the  receiving  system.  The  most 
common  method  used  for  measuring  the  phasefront  is  shearing 
interferometry.  The  search  for  improved  phase  estimation 
techniques  using  the  outputs  of  shearing  interferometers 
constitutes  the  basis  of  this  thesis.  The  shearing  inter¬ 
ferometer  will  be  discussed  in  Chapter  II. 

System  Description 

Active  optics  systems  have  been  widely  described  in 
the  literature  and  only  a  brief  description  is  necessary 
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for  the  purpose  of  this  paper.  Detailed  systems  descrip¬ 
tions  are  given  by  Hardy  (Ref  3) ,  Hudgin  (Ref  5) ,  Rimmer 
(Ref  10)  and  Martoni  (Ref  7:1)  among  many.  The  system 
operation  basically  consists  of  wavefront  detection,  phase- 
front  estimation,  and  beam  control.  Figure  1  shows  a  sim¬ 
plified  block  diagram  of  a  typical  system.  In  such  a  sys¬ 
tem,  a  reflecting  telescope  is  used  both  as  entrance  aper¬ 
ture  for  the  optical  radiation  from  the  target,  and  as  exit 
aperture  for  the  laser  beam.  Both  input  and  output  wave- 
fronts  travel  the  same  path  in  opposite  directions.  Part 
of  the  incoming  field  is  deflected  off  onto  a  phasefront 
sensor  usually  composed  of  two  shearing  interferometers. 

The  output  of  this  sensor  is  translated  into  control  com¬ 
mands  which  actuate  deformable  mirrors  off  which  the  laser 
beam  is  reflected  onto  the  target.  Since  this  is  done  in 
real  time,  the  atmosphere  induces  on  the  laser  wavefront 
the  reverse  distortion  effects  induced  on  the  detected 
field.  The  radiation  reaching  the  target  has,  therefore, 
been  adjusted  for  maximum  irradiance. 

Problem  Statement 

There  is  an  issue  expressed  by  the  Weapons  Laboratory 
that  when  the  target  radiance  is  low,  the  detectable  field 
is  not  strong  enough  to  perform  phasefront  estimation  base 
on  con' inuous  signal  measurements.  A  phase  estimation  tech¬ 
nique  was  proposed  by  J.  C.  Wyant  in  1975  (Ref  16:2624), 
based  on  detector  processing  of  photon  counts  observed 
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Fig  1.  A  Typical  Real-Time  Wavefront  Correction  System 
(Adapted  from  Ref  7) 

) 

during  short  time  intervals.  Although  his  assumption  is 
shot  noise  limited  detection,  his  results  are  free  from 
noise  considerations,  and  the  simplicity  of  the  resulting 
algorithm  motivates  one  to  investigate  more  deeply  into  his 
technique.  The  overall  wavefront  is  obtained  from  a  mapping 
of  independent  phase  estimates  over  the  aperture  of  the  sys¬ 
tem. 


The  purpose  of  this  paper  is  to  explore  the  photon 
counting  technique  from  a  statistical  point  of  view  and  to 
examine  in  detail  the  effects  of  receiver  and  signal  shot 
noises  in  order  to  determine  the  extent  to  which  this  pro¬ 
cedure  can  be  applied  efficiently.  Joint  processing  of 
multiple  detectors  has  also  been  considered  within  the 
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scope  of  this  work  in  an  attempt  to  obtain  improved  results 
over  linear  mappings  currently  used  to  estimate  the  wave- 
front  over  the  region  of  interest. 

Approach 

The  basic  approach  to  the  problem  is  based  on  countable 
observables  obtained  by  decomposing  the  continuous  output 
of  a  detector  into  discrete  components.  The  problem 
reduces  to  classical  parameter  estimation  theory  and  will 
be  carried  out  using  the  concepts  of  Maximum  A  Posteriori 
and  Maximum  Likelihood  estimation  theories.  To  fulfill  the 
purpose  of  this  approach,  it  is  assumed  that  specially 
designed  detectors  are  available,  based  on  the  promising 
future  of  charge-coupled  devices  (CCD)  (Ref  13 -.Chapter  12), 
which  are  capable  of  integrating  the  detected  field  signal 
over  short  periods  of  time  and  dumping  the  contents  into 
the  registers  of  a  computer.  Such  a  receiver  can  be  repre¬ 
sented  mathematically  with  a  time  correlator  to  be  described 
later  in  Chapter  III. 

Scope  and  Assumptions 

In  this  thesis,  the  wavefront  process  will  be  con¬ 
sidered  slow  varying  in  time  such  that  a  stepwise  approxi¬ 
mation  to  the  actual  variation  can  be  performed.  The  phase 
in  each  step  of  length  T  will  be  constant  and  the  analysis 
of  the  problem  will  be  limited  to  a  single  observation 
interval  (0,T) .  An  extension  to  sequential  estimation  over 
successive  intervals  can  then  be  performed  using  Gauss- 
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Markov  parameter  models.  That  will  not  be  considered  in 
this  thesis.  Constant  phase  is,  therefore,  the  basic 
assumption  of  the  forthcoming  developments.  The  disturb¬ 
ance  induced  by  the  atmosphere  will  be  modeled  as  an  addi¬ 
tive  noise  phase  to  the  target  wavefront.  There  is  no  need 
to  distinguish  between  the  target  and  noise  induced  phases 
because  the  reverse  disturbance  effects  produced  on  the 
laser  wavefront  by  the  atmosphere  cancel  the  noise  compon¬ 
ent.  Therefore,  they  will  be  lumped  together  into  a  single 
parameter  6,  where  6  is  a  random  variable.  The  probabilis¬ 
tic  descriptions  of  0  will  be  fitted  to  the  ones  given  by 
Gaussian  and  uniform  probability  density  functions  in  the 
interval  (-Tr,TT).  Other  than  for  ease  in  estimator  deriva¬ 
tion,  the  Gaussian  model  is  chosen  considering  that  for 
slow  varying  wavefronts,  the  phase  variations  are  more 
likely  to  be  concentrated  about  the  zero  value  and  less 
likely  as  the  phase  value  increases.  On  the  other  hand, 
because  the  sensor  output  is  a  sinusoidal  variation,  the 
uniform  density  is  also  a  logical  choice  since  the  phases 
are  equally  likely  in  the  interval  (-ir,TT). 

Estimation  of  0  will  be  analyzed  in  the  presence  of 
noise  from  two  points  of  view:  predominant  detector  noise 
and  predominant  signal  shot  noise  processes.  Chapter  IV  is 
devoted  to  the  analysis  of  the  detector  limited  case,  where 
the  noise  is  modeled  as  a  continuous  white  Gaussian  random 
process.  Chapter  V  is  devoted  to  the  analysis  in  signal 
induced  noise,  where  the  noise  is  modeled  as  a  discrete 
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Poisson  count  process.  A  mixed  mode  of  continuous  and  dis¬ 
crete  processes  will  not  be  considered  in  this  paper,  but 
deserves  future  attention.  An  extension  of  the  Gaussian 
noise  analysis  is  performed  in  Chapter  VI  where  the  same 
concepts  are  applied  to  joint  processing  of  two  plane 
detector  arrays.  Finally,  Chapter  VII  makes  a  summary  of 
results  and  conclusions,  and  presents  suggestions  for  fur¬ 
ther  study. 
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II  The  Shearing  Interferometer 


The  wavefront  sensor  of  interest  in  this  thesis  con¬ 
sists  of  two  ac  heterodyne,  lateral  shearing  interferometers. 
This  sensor  configuration  is  depicted  in  Figure  2.  The 
field  entering  the  system  is  beamsplit  into  two  channels. 

Each  channel  has  a  shearing  interferometer  composed  of  two 
confocal  lenses  which  constitute  a  Fourier  transform  pair. 

The  field  at  the  common  focal  point  is  the  Fourier  trans¬ 
form  of  the  received  field.  The  Fourier  transform  field  is 
sampled  with  a  radial  grating  displaced  off  the  optic  axis 
and  rotating  with  velocity  v  and  period  t .  An  expanded 
view  is  shown  in  Figure  3. 

If  the  aperture  is  located  at  the  front  focal  plane  of 
lense  the  field  observed  at  the  back  focal  plane  of 
lense  L2  is  sheared  into  a  number  of  components  laterally 
displaced  from  each  other  by  an  equal  distance  Ms^,  known 
as  the  shear  distance,  where  is  a  parameter  of  the  inter¬ 
ferometer  determined  by  v,  and  M  is  a  magnification  factor 
determined  by  the  ratio  of  the  lenses  in  the  system.  The 
field  components  so  displaced  interfere  with  each  other,  and 
the  modulated  output  is  observed  in  the  form  of  an  inter- 
ferogram  spread  over  a  detector  array.  It  is  this  slowly 
but  continuously  changing  interferogram  that  contains  the 
wavefront  phase  information  being  sought. 
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Rotating  Grating 


Fig  2.  A  Two  Shearing-Interferometer  Sensor  Used  with 

Wavefront  Correction  Systems  (Adapted  from  Ref  3) 

Output  Field 

The  sheared  field  at  the  focal  plane  takes  on  slightly 
different  forms  for  broadband  (white  light)  and  monochro¬ 
matic  fields.  However,  the  same  equation  is  applicable  in 
both  cases  v;hen  the  shear  is  small.  Thus,  for  spatially 
coherent,  white  light  aperture  fields  when  the  shear  dis¬ 
tance  Ms^  is  small,  the  detector  field  intensity  is  given  by 
(Ref  6:58,61) 

+  -  M^[A(r  )]^cosC{r  .)sin(u)t  +  0  (r  )) 

TT  Si  Si  f  ^ 

for  0  <  t  <  T,  (2-1) 
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Fig  3.  Expanded  View  of  the  X-Channel  Interferometer 
(Adapted  from  Ref  6) 


are  phase  functions  in  terms  of  the  phases  at  locations 
from  r^.  The  term  A(r^)  is  the  amplitude  of  the  aperture 
field  at  location  r^  =  Mr^  shown  in  Figure  3,  For  slow 
varying  fields  over  the  aperture  and  constant  intensity, 
Eq.  (2-1)  can  be  simplified  to  (Ref  7:10,6:59) 
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The  frequency 

“  =  ^  (2-5) 

is  the  fundamental  modulation  frequency  of  the  detector 
field  (Ref  6:41).  Double  frequency  and  higher  order  fre¬ 
quency  terms  have  been  dropped  from  Eqs.  (2-1)  and  (2-4) , 
anticipating  subsequent  signal  processor  filtering. 

Detector  Signal 

The  output  r(t)  of  a  detector  at  location  r^  in  the 
back  focal  plane  is  computed  from  the  received  field  intensi¬ 
ty  t'®^^a  t^  ^  detector  noise  as  foMov;s: 

r(t)  =  s(t,6(t))  +  n(t),  at  r^  t  Aperture,  (2-6) 

where  (Ref  2:54-55) 

s(t,e(t))  -  hi-  / 

O 

for  0 (r^  ^)  constant  over  A^,  (2-7) 

is  the  signal  current.  The  constant  n/hf  is  a  detector  con- 

o 

version  factor,  A^  is  the  area  of  the  detector,  and 

t'^^^d  t^  ^  field  of  the  detector  given  by  Eq. 

(2-4)  at  location  r,  =  r  /M.  The  assumption  made  here  is 

1  ^1 

that  the  field  intensity  at  r^  is  constant  over  the  detec¬ 
tor  area.  If  the  observation  time  is  short  enough  such  that 
the  phase  is  approximately  constant  in  the  interval  (0,T)  to 
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fit  the  assumptions  of  Chapter  1,  the  time  dependence  of 
the  phase  can  be  dropped.  The  interval  T  is  assumed  to  be 
much  larger  than  the  period  T  of  the  modulated  field. 
Equation  (2-6)  becomes 

r(t)  =  s(t,6)  +  n(t),  0  <  t  ^  T  (2-8) 


where 


s(t,0)  =  a  +  b  sin(mt  +  6)  (2-9) 

is  the  output  signal  of  one  detector,  and 

^  (2-10) 
o  \  n  / 

„  4  ^  .  (2-11) 

Chapter  III  further  models  the  detector  signal  in  order  to 
pursue  the  photon  count  approach  established  in  Chapter  I. 
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Ill  Phase  Estimation  Preliminaries 


Reconstruction  of  the  wavefront  requires  knowledge  of 
the  phase  distribution  of  the  optical  field  within  the  aper¬ 
ture.  However,  the  knowledge  provided  by  the  shearing 
interferometers  is  in  the  form  of  a  phase  difference  between 
two  points  along  the  line  of  shear  as  given  by  Eq.  (2-3) . 

In  the  simplest  form,  the  problem  is  of  phase  difference 
estimation  from  the  measurement  of  a  single  detector.  The 
phase  difference  is  known  as  the  wavefront  difference  func¬ 
tion  given  by  (Refs  6:49  and  7:16) 

A<|)(r^)  =  ({)  (r^  -  Ms^)  -  4.  (r^  +  Ms^)  (3-1) 

where  <}>  (t^  ~  Ms^)  and  4>  (r^  +  ^®d^  phases  at  the 

aperture  which  we  ultimately  want  to  estimate.  The  compo¬ 
nents  of  the  wavefront  difference  function  in  cartesian 
coordinates  are  given  by 


-  Ms^) 

-  -  "=dx'’'a>' 

X-shear 

(3-2) 

-  ’'a  -  »=dy'' 

Y-shear 

(3-3) 

<}>  (r^  +  Mr  ) 

X-shear 

(3-4) 

Y-shear 

(3-5) 

Figure  4  illustrates  the  phase  distribution  described  by 
Eqs.  (3-2)-(3-5).  The  shear  distances  are  usually  made 
equal  so  that  s^^^  =  s^^  —  s^.  When  M  =  -1  (r^  and  s^  are 


12 


Fig  4.  Phase  Distribution  at  the  Aperture  Seen  by 
Detectors  in  the  Back  Focal  Plane 

also  negative) ,  the  wavefront  difference  function  measured 
at  the  detector  plane  gives  information  of  the  phase  points 
located  one  shear  distance  s^  av/ay  on  each  side  of  the 
detector.  To  estimate  the  actual  phases  the  information 
must  be  collected  from  all  detectors.  This  will  be  addressed 
in  Chapter  VI.  Chapters  IV  and  V  will  concentrate  on  the 
estimation  of  the  phase  difference  0  (r^)  from  one  detector 
only.  Prior  to  considering  the  estimation  problem,  a  signal 
model  must  be  found  using  the  hypothetical  CCD  photon  count¬ 
ing  detector. 


f 
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Signal  Model 

A  signal  model  is  developed  in  this  section  to  estimate 
the  phase  difference  6  measured  by  each  detector  following 
the  integration  approach  with  CCD  devices.  This  is  equiva¬ 
lent  to  saying  that  the  output  of  each  detector  is  sampled 
by  a  correlator  structure  as  shown  in  Figure  5.  In  this 
correlator  model,  the  observation  r(t),  which  represents 
the  cathode  current,  is  sampled  by  a  vector  of  K  orthogonal 
basis  functions 


4»(t)  =  [ijj^(t)  ...  i^j(t)  ...  il^j^(t)]'^,  (3-6) 


(the  superscript  T  means  transpose  of  the  matrix)  where  each 
basis  function  li) ^  (t)  is  given  by 


(t) 


-  ,  t.  <  t  <  t.^, 
q  '  3  -  -  3+1 


=  0  ,  otherwise. 


(3-7) 


with  q  being  the  electron  charge  and  t^  a  sequence  of  )<: 
equal-length,  non-overlapping  time  subintervals  in  one 
observation  interval  of  length  T.  The  resulting  output  is 
a  vector  of  k  observations 

r  =  [r^  ...  ...  rj^]"^  (3-8) 

which  represent  photon  counts  in  the  time  subinterval 
(tj,t^^j^).  With  the  signal  represented  in  this  manner,  the 
following  equalities  are  true: 
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r  . 
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Fig  5.  Equivalent  Correlator  Model  of  the  CCD  Detector 


lij^  (t)  j  (t)  dt 


2  '  ^  ~  3 

q  k 


0  ,  i  ^  j 


(3-9) 


and 


2,  k 

r(t)  =  l.i.m.  3_  i  r  .ij; .  (t)  , 
k  ^  j=l  3  3 


(3-10) 


is  satisfied.  This  condition  known  as  the  Cauchy  criterion 
for  convergence  of  random  sequences  (Ref  8:262)  is  satis¬ 
fied  by  the  choice  of  the  ^j's  as  non-overlapping  time 
subintervals.  Equations  (3-9)  and  (3-10)  are  the  tools 
needed  in  the  forthcoming  parameter  estimation  analysis. 
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This  analysis  will  be  the  classical  discrete  observation 
problem  with  generalizations  to  the  continuous  observation 
form  using  Eq.  (3-10).  For  this  purpose  and  to  define  t^ 
properly,  the  observation  period  is  divided  into  k  equal 
subintervals  (tj,tj_^j^).  The  functions  (t)  are,  therefore, 
identical  except  for  their  sequential  position  in  time.  In 
order  to  simplify  the  derivation  let  the  initial  time  be 
tj^  =  -T/2k  and  the  final  time  t^^+T  be  tj^  =  (2k-l)T/2k  for 
any  interval  T.  Thus,  the  integration  limits  for  the  jth 
subinterval  are 


_  (2j-3)T 

~  2k 


(3-12) 


and 


t . 


j+1 


_  <2j-l)T 

2k 


The  correlation  operation 


r  (t) (t) dt  , 


indicated  in  Figure  5  is  then  given  by 

r(t) 


-^j+1 


dt 


(3-13) 


(3-14) 


f^j+1  rj+l 

^  I  sl^  ^  J  ^ 


3  3 

which  can  be  conveniently  defined  as 


(3-15) 


r.  =  s.(9)  +  n.  , 


(3-16) 
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where 


J 


Sj(0)  = 


0)dt 


(3-17) 


n .  = 

3 


r^j+1 


n (t) dt 


(3-18) 


with  tj  and  given  by  Eqs.  (3-12)  and  (3-13).  Because 

the  observation  r(t)  has  the  form  of  an  electrical  current, 
the  correlation  functions  of  Eq.  (3-7)  were  selected  so  that 
r j ,  s(0),  and  n^  have  the  physical  interpretation  of  photons, 
both  signal  and  noise,  counted  in  each  observation  subinter- 
val  (tj,tj^j^).  The  sum  Z  corresponds  to  the  total 
photons  counted  in  the  observation  interval  (tj^,t^+T).  If 
the  functional  form  of  s(t,0)  given  by  Eq.  (2-9)  is  substi¬ 
tuted  into  Eq.  (3-17) ,  the  signal  in  discrete  form  becomes 


Sj(0)  = 


[a  +  b  sin(a)t  +  0)]dt  ,  (3-19) 


for  0  (t)  constant  over  T. 

The  integration  indicated  by  Eq.  (3-19)  yields 


-  I  -  ‘ji 


+  -  [cos(u)tj  +  0)  -  cos(ujtj^j^  +  0)]  .  (3-20) 


If  the  observation  interval  is  several  times  the  modulation 
period  so  that  many  frequency  cycles  are  observed  each  time. 
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then  they  are  related  by 


N  = 


T 

T 


(3-21) 


where  t  is  as  in  Eq. 
substituted  into  Eq. 


Sj  (0) 


(2-5)  . 
(3-20)  , 

bT 

2TrqN 


With  the  identity  of  Eq.  (3-21) 
the  signal  equation  becomes 


(cos  (^(2  j-3) +0) -cos  (^(2j-l) +0  )  ]  ,  (3-22) 


where  the  definitions  of  Eqs .  (3-12)  and  (3-13)  were  used 

as  integration  limits.  However,  in  order  to  preserve  the 
phase  information,  a  small  shear  distance  is  required  (Refs 
6:41  and  16:2622).  For  small  shear,  the  modulation  fre¬ 
quency  u)  must  be  low  relative  to  the  optical  field  frequency 
such  as  25  kilohertz  (Ref  4:363).  Thus,  typically  x  =  0.04 
milliseconds  and  N  can  range  from  unity  to  a  very  large  num¬ 
ber  depending  on  how  fast  the  phase  changes  in  time.  For 
simplicity  and  to  check  with  Wyant's  results,  let  N  =  1  so 
that  Eq.  (3-22)  becomes 


Sj  (0) 


aT  bT 
^  2iTq 


[cos(J(2j-3)+0)-cos(J(2j-l)+0) ]  .  (3-23) 

Although  Eq.  (3-23)  gives  an  exact  expression  for  the  photon 
counts  in  terms  of  0,  it  is  not  in  a  workable  form.  It  is 
therefore  necessary  to  expand  the  cosine  functions  to  obtain 


18 


^  qf  I  [cos£(2j-3) -cos^{2j-l)  ]  cosB 

- [sin^(2 j-3) -sin^(2 j-1) ] sinoj . 

(3-24) 

Equation  (3-24)  is  the  expression  that  will  be  used  to  rep¬ 
resent  the  signal-generated  photon  counts.  For  ease  in  fur¬ 
ther  derivations,  Eq.  (3-24)  can  be  more  compactly  written 
as 

Sj(e)  =  ^  [a.cosG  -  BjSinG]  ,  (3-25) 

where 

Oj  =  cos  ^  (2j-3)  -  cos  ^  (2j-l)  (3-26) 

=  sin  J  (2j-3)  -  sin  J  (2j-l)  .  (3-27) 

With  the  above  signal  model  completed,  the  estimation 

procedure  to  obtain  0 (r)  will  be  considered  next.  The  "hat" 

over  G  (r)  indicates  that  this  is  just  an  estimate  of  0,  and 

the  argument  "r"  indicates  that  the  estimate  is  in  terms  of 
the  observation  vector  r  of  Eq.  (3-8) ,  The  estimate  G (r) 
will  be  performed  based  on  the  criteria  of  Maximum  A  Pos¬ 
teriori  and  Maximum  Likelihood  estimation.  Before  applying 
these  criteria  to  find  the  phase  estimates  in  Chapters  IV, 

V,  and  VI,  these  concepts  will  be  briefly  e.xplained  in  the 
following  section. 
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Maximum  A  Posteriori  and  Maximum  Likelihood  Estimation 
Theories 

Estimation  of  the  phase  difference  8  measured  by  a 
single  detector  at  a  fixed  location  r^  and  time  t,  and  of 
the  actual  phases  $  measured  over  the  aperture  by  two  detec¬ 
tor  arrays  will  be  made  applying  the  concepts  of  Maximum  A 
Posteriori  (MAP)  and  Maximum  Likelihood  (ML)  estimation 
theories.  The  MAP  and  ML  estimates  of  0  are  those  values 
6 (r)  for  which  the  probability  of  having  found  the  correct  8 
after  the  measurement  is  made  is  maximum.  This  is  equiva¬ 
lent  to  maximizing  the  a  posteriori  probability  density 
function  of  0  conditioned  on  the  observations.  Let  this 
a  posteriori  density  be  represented  by 


'  (3-28) 

and  let  it  be  maximized  by  the  proper  choice  of  9 (r) .  With 
the  condition  that  the  maximum  occurs  within  the  range  of  9 , 
maximization  is  obtained  by  setting 

.  0  .  (3-29) 

An  equivalent  and  sometimes  more  convenient  form  of  Eq. 
(3-29)  is  given  by 


^(ln£o,^(''|rJl  »  0  . 


(3-30) 


By  using  the  Baye ' s  rule  substitution 


j,(0lr)  = 


^rle 


(3-31) 
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In  Eq.  (3-30) ,  the  necessary  but  not  sufficient  condition 
for  the  MAP  estimate  is  found  to  be  (Ref  14:58) 

^[Inf^lQ  (rle)  +  Infg  (0)]  |  =  0  .  (3-32) 

“  MAP 

Because  0  is  in  the  argument  of  a  sine  function,  it  is 
modulo  27r.  The  ML  estimate  of  the  phase  is  then  found  by 
modeling  the  a  priori  density  as  uniform  within  the 

range  of  0,  which  is  the  interval  (-TT,ir).  The  ML  estimate 
is,  therefore,  given  by  (Ref  14:65) 

^[lnfj.|Q  (r|0)]  I  =  0  .  (3-33) 

ML 

Equations  (3-32)  and  (3-33)  are  the  equations  for  MAP  and 
ML  parameter  estimation.  The  parameter  so  found  is  the  one 
with  highest  probability  of  being  the  true  value.  There,  of 
course,  may  be  false  solutions,  and  an  error  is  associated 
with  each  estimate.  The  errors  will  be  treated  in  the  next 
chapters  after  the  solution  algorithms  have  been  found. 

The  problem  of  estimating  0 (r  .)  in  the  presence  of  noise 

Si  f  X. 

will  be  addressed  next;  first,  it  will  be  investigated  in 
the  context  of  a  Gaussian  problem  where  the  only  noise  is 
due  to  thermal  noise  limited  detectors,  and  then  in  the  con¬ 
text  of  a  Poisson  problem  when  the  shot  noise  is  the  predom¬ 
inant  noise  source.  The  approach,  however,  will  be  the  same 
in  both  Gaussian  and  Poisson  cases:  photon  counting  as  the 
only  alternative  under  low  light  level  conditions. 
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IV  Phase  Estimation  in  White  Gaussian  Noise 

When  the  output  field  of  the  interferometer  is  received 
by  a  detector  array  with  predominant  thermal  noise,  the 
noise  is  adequately  modeled  by  a  zero-mean,  stationary 
white  Gaussian  process  described  by  the  probability  density 
(Ref  8:360) 

^n(t)^"^  "  [2Tro2]-l/2  ,  (4-1) 


where  the  variance  is  given  by 


(4-2) 


The  process  has  a  double-sided,  flat  power  spectral  density 
N 

over  and  beyond  the  spectral  region  of  interest.  If  the 
noise  is  described  in  the  form  of  a  random  current,  addi¬ 
tive  to  the  current  output  of  the  detector,  the  spectral 
density  is  given  by  (Ref  8:361) 

N  2kT 

—  (watts  per  hertz  —  per  ohm)  ,  (4-3) 

Z  K 

e 

where  k  is  the  Boltzmann  constant,  is  the  operating  tem¬ 
perature  and  R  is  the  equivalent  resistance  of  the  detec- 
®  N 

tor.  On  a  per-ohm  basis,  has  the  units  of  energy.  The 
correlation  function  of  the  noise  process  is  the  inverse 
Fourier  transform  of  the  power  spectral  density.  For  sta¬ 
tionary  white  noise  it  is  given  by 

N 

E[n(t)n(t')]  =  -Y  «(t-t')  .  (4-4) 
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On  n  per-ohm  basis,  the  correlation  fuiiction  has  the  units 
of  power.  The  first  order  statistic55  aio  jivon  by 

E[n(t)]  =  0  .  (4-5) 


Before  applying  the  c^n.cepts  of  MAP  and  ML  estimation, 
it  is  necessary  to  obtain  the  density  function  of  the  obser¬ 
vation  vector  r  conditioned  on  the  parameter  0,  as  required 
by  Eqs.  (3-32)  and  (3-33).  Therefore,  the  following  devel¬ 
opment  is  made: 

Referring  to  Eqs.  (3-14)  and  (4-6) 


Referring  to  Eqs.  (3-9)  and  (4-4) 


E[nj] 


dt'E[n(t)n(t')]vj  (t)tijj  (f) 


N 

dt’I-^  5(t-t')i);j(t)-J;j(t')] 


(t)dt 


N  T 
o 

2q^k 


(4-7) 
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Theref  Te, 


E[rj]  =  Sj(e)  (4-8) 

NT 

Var[r.]  =  Var[n.]  =  — .  (4-9) 

J  ^  2q  k 

In  addition  to  the  variance  and  the  mean  of  r ^ ,  it  is  neces¬ 
sary  to  demonstrate  that  r^  and  r^  are  uncorrelated.  Thus, 

cov[rj,r^]  =  E[(rj  -  E[rj])(rj  -  Elr^])] 

=  E[  (r  -  s.  (0)  )  (r  -  s  (6)  )  ] 

J  J  Si  4 


=  Eln.ngl 


Further  evaluating  Eq.  (4-10) 


(4-10) 


Eln.n^l 


dt'E|n(t)n(t')  )♦.  (t)*.  (f  )1 
J  'd 


dt'  6  (t-f  )'J^  •  (t)ijj  (f  )  ] 

^  J  q 


T 

Therefore,  from  Eqs, 


cov (r . , r  ] 

1  q 


'i'j  (t)ii;q(t)dt  =  0,  j  5^  q  . 

(3-16),  (4-6),  and  (4-11) 

=  E[n.n  ]  =  0  ,  j  q  , 

J  'd 


(4-11) 


(4-12) 
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s.  (0)  ) 


(4-14) 


The  MAP  estimate  of  a  parameter  0  observed  in  noise  is 
based,  as  indicated  by  Eq.  (3-32),  on  the  conditional  dens¬ 
ity  of  the  observations  r  given  0,  as  well  as  on  the 
a  priori  density  of  0.  Considering  first  the  conditional 
density  of  r  given  0,  the  following  procedure  is  developed; 


From  Eq.  (4-14)  , 


f^|e(£le)  = 


■  2,  “I 

aJi 

vN 

o-J 


k/2 


exp 


2  k 

'q  -  *  =5ie)) 

o  j=l  . 


(4-15) 


The  natural  logarithm  of  Eq.  (4-15)  is 


2  ttN  T  N  T  ~  2r^s  (0)  +  3^(9)) 

— '  o  o  3=1 


2q  k  ^  j 


_  2_Ji  I  ( 0 ) 
o  3=1 


_  a!ii  z  r2  +  Is  In  ^ 

NT  .  ,  2  TTN  T  * 

o  3=1  o 

The  derivative  of  Eq.  (4-16)  with  respect  to  6  is 


(4-16) 


30  ^r 1 


^  £  r  s  ( 0 ) 

NoT  ""j  39 


-  ^  =j<«'  w  • 

o  3=1  -* 

(4-17) 

By  substituting  Eq.  (4-17)  into  Eq.  (3-32) ,  a  general 
expression  for  the  MAP  estimate  is  obtained  and  is  given  by 


2  k 
o  3=1 

(4-18) 

The  solution  of  Eq.  (4-18)  when  the  appropriate  expressions 
for  Sj(0)  and  fQ  (0)  are  substituted  for  yields 


Before  doing  this,  it  is  interesting  to  observe  what  happens 
when  k  becomes  very  large.  It  is  very  simple  to  prove  that 


g(t)  =  l.i.m.  A  S  g.()!.(t) 

k  ^  CO  j  =  i 


=  l.i.m.  A  Z  h.(}).(t)  , 

k  -*•  j=l  ^  ^ 


(4-19) 


where  (ti^(t)  are  orthogonal  functions  such  that 


I 


(t,  =  ± 


(t)  =  0  ,  i  7^  j  , 


(4-20) 


then  l.i.m.  A 

k  -*■  “> 


Z  g.h.  =  / 


g(t)h(t)dt  .  (4-21) 


With  the  help  of  Eqs.  (4-19) -  (4-21)  and  the  orthogonal  rela¬ 
tion  of  Eq.  (3-10)  ,  Eq.  (4-18)  becomes  (tj^->0  as  k-*-“>) 


/  [r(t)  -  s(t,0)]-^  s(t,9)dt  +  -^[InfgO)]  =  0  . 

o  X 


(4-22) 


Equation  (4-22)  is  the  general  expression  for  the  MAP  esti¬ 
mate  0  of  a  parameter  0  in  a  continuous  waveform  r(t) 
observed  in  white  Gaussian  noise  (Ref  14:275). 

Returning  to  Eq.  (4-18),  the  substitution  for  Sj(0) 
given  by  Eq.  (3-25)  is  made  to  obtain  0 (r)  as  follows: 
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From  Eq.  (3-25) 


aqi  Kqi 

s.(0)  =  — j-  +  '  (a.cos0  -  g.sin0] 

3  qk  2TTq  ‘3  3 


(4-23) 


Its  phase  derivative  is 


90  ®3^°^  2Tiq  ‘“3 


[a^sin0  +  BjCOs0] 


(4-24) 


Equation  (4-18)  can  now  be  written  as 


-  (ajSin0  +  3jCos0])| 


+  ^  [InfQ(Q)]  =  0  . 


(4-25) 


In  order  to  simplify  notation,  let  a  new  function  F(0)  be 
defined  as 


F(0)  ^  rinfg  (0))  . 


(4-26) 


After  some  algebraic  manipulation  of  Eq.  (4-25) ,  the  follow¬ 
ing  MAP  estimate  of  0  is  obtained: 


aT, 


1  |tr.  -  ^]a.sin0(r)  +  [r.  -  ^)6.cos0(r); 
3  qk  3  -  3  qk'  3  -  I 


bT 


2.q  jiiiy  2 


.)si 


^)sin20(r)  +  a.B.cos20(r) 


3  3 


ttN 
_ o 

qkb 


F(G) 


0  , 


(4-27) 
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where  a_,  and  £5^  are  defined  by  Eqs.  (3-26)  and  (3-27)  . 
Although  Eq.  (4-27)  defines  is,  it  is  not  in 

final  form.  The  second  summation  in  Eq.  (4-27)  vanishes 
for  all  values  of  k  except  for  Ic  =  2.  This  was  determined 
on  a  computer  cheeJe  and  will  be  used  without  a  rigorous 
proof.  Therefore,  Eq.  (4-27)  becomes 


k 

I 

j=l 


— p] a . sinO (r )  +  E 

qk  3  -  ^.1 


e^cosG  (r) 


ttN 

-  7^  =  O'-  k  >  3  ,  (4-28) 


wh>'re  the  constraint  that  k  >  3  is  imposed  on  the  estimator 
because  for  k  =  1,  the  two  sums  vanish. 

Now,  the  conditions  for  which  the  second  summation  in 
Eq.  (4-27)  vanishes  are  to  be  investigated.  For  that  to 
happen,  it  is  required  that 


k  (a? 
^  _ J, 

j=l 


2 


0 


(4-29) 


k 

and  Z  a. 6.  =  0  .  (4-30) 

j=l  ^  J 

Substitution  of  the  expressions  given  by  Eqs.  (3-26)  and 
(3-27)  for  Uj  and  6^  into  Eqs.  (4-29)  and  (4-30)  yields 

k  tt  1  2 

Z  [cos  ^(2j-3)  -  cos  p(2j-l)]^ 

j  =  l  ^  k 


-[sin  ^(2j-3)  -  sin  J(2j-1)]^}  =  0  ,  (4-31) 
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and 


T  {[cos  ^(2j-3)  -  cos  ^(2j-l)] 

j  =  l  ^  ^ 

[sin  ^(2j-3)  -  sin  i(2j-l) ] }  =  0  .  (4-32) 

By  carrying  out  the  operations  indicated  by  Eqs.  (4-31) 
(4-32) ,  it  is  found  that  the  conditions  for  a  vanishing 
term  are 

Z  (cos  ^(2j-3)  +  cos  ^(2j-l) 

j=l  ^  ^ 

-  2cos  ^(2j-2)}  =  0  (4-33) 

and  Z  {sin  ^{2j-3)  +  sin  ^(2j-l) 

j=l  ^  ^ 

-  2sin  ^(2j-2)}  =  0  .  (4-34) 

The  question  is,  for  what  values  of  )c  do  Eqs.  (4-33)  and 
(4-34)  hold?  The  easiest  way  to  find  out  is  by  computing 
the  summations  for  a  number  of  k's.  This  was  done  on  the 
computer  for  k  =  1  to  k  =  28  with  the  assuring  result  that 
only  for  k  =  2  both  sums  do  not  equal  zero.  It  was  also 
observed  (and  will  be  used  without  a  rigorous  proof)  that 

k  k 

Z  a  =  Z  B.  =  0  (4-35) 

j=l  ^  j=l  3 

for  all  k. 

Therefore,  Eq.  (4-28)  simplifies  even  further  to  yield  the 
final  result  of  the  discrete  MAP  estimate  of  0  as 


30 


f 


k  ^  k 

Z  oi.r.sin6(r)  +  Z  B-r.cosO(r) 


j=l 


D  : 


ttN 


qkb 


j=l 


D  D 


F(0)  =  0  ,  k  >  3  . 


(4-36) 


For  the  particular  case  when  the  phase  6  is  a  Gaussian  ran- 

2 

dom  variable  with  variance  less  than  0.8  rad  ,  the  density 
function  is  practically  given  by 


fg(e)  =  [2TTa^]~^/2exp[-eV2o^]  , 


,2  ,.2, 

e- 


(4-37) 


in  the  interval  (-TT,Tr).  Then, 


F(0)  =  ■^[lnfg(0)]  =  -  • 

^0 


By  letting  0  =  0  (r) ,  Eq.  (4-36)  becomes 


e(r)  = 


qkbCg 
TrN_  ' 


(4-38) 


{-  Z  B.r.cos0(r)  -  Z  a.r.sin0(r)}  .  (4-39) 


j=l 


3  3 


3=1 


3  3 


Equation  (4-39)  is  the  MAP  estimate  of  a  Gaussian  phase  0  in 
terms  of  discrete  observations.  The  MAP  estimate  is, 

No 

therefore,  a  function  of  the  detector  noise  given  by  Eq. 

2 

(4-3) ,  the  variance  of  the  random  phase  and  the  amplitude 
b  of  the  signal.  Equation  (4-39)  is  of  the  form 


X  =  Ucosx  +  Vsinx 


(4-40) 


and  cannot  be  reduced  any  further.  It  can  be  implemented 
in  the  form  of  a  photon  processor  as  shown  in  Figure  6. 
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GAIN 


Fig  6.  Photon  Count  Processor  for  MAP  Estimation  of  0 

Equation  (4-40)  can,  however,  be  solved  numerically  on  the 
computer.  An  effective  way  is  to  expand  the  sine  and 
cosine  functions  into  series  and  solve  the  resulting  poly¬ 
nomial  iteratively  as  a  predictor-corrector  algorithm.  A 
fifth  order  expansion  (three  terms  in  each  function)  results 
in  reasonably  good  solutions  up  to  0.7  radians,  but  it  gives 
gross  errors  for  higher  phase  angles.  On  the  other  hand,  a 
seventh  order  expansion  (four  terms  in  each  function)  proved 
to  give  very  accurate  results  regardless  of  the  phase  value. 
Thus,  using  four  terms  in  the  expansion  of  the  trigonometric 
functions  of  Eq.  (4-40)  (Ref  12:472),  the  following  poly¬ 
nomial  is  obtained: 
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5040U  +  5040{V-l)x  -  2520Ux^  -  840Vx^ 

+  210Ux'^  +  42Vx^  -  7Ux^  -  Vx^  =  0  (4-41) 

Equation  (4-41)  has  seven  roots.  It  was  solved  using  a 
subroutine  from  the  International  Mathematical  and  Statisti¬ 
cal  Library  (IMSL)  package  available  for  use  with  the  CDC 
SGOO/Cyber  74  computer  system  at  the  Air  Force  Institute  of 
Technology.  The  real  roots  so  found  that  satisfy  Eq.  (4-40) 
are  the  estimator  solutions.  It  was  found  from  several 
tests  (no  proof  included)  that  the  lowest  root  always  gives 
the  solution  with  the  smallest  error,  and  is  therefore  the 
sought  MAP  estimate.  As  an  example,  the  solution  to 

X  =  11.23108494624  cosx 

-38.93776905131  sinx  ,  (4-42) 

taken  from  an  actual  simulation  run,  is,  using  single  pre¬ 
cision  , 


X  =  0.2740523040524  radians  (4-43) 

with  a  discrepancy  of 

7.920348821244E-9  radians  ,  (4-44) 

which  is  an  extremely  accurate  result. 

For  the  purpose  of  comparison,  it  is  easily  found 
using  Eq.  (4-22)  that  the  MAP  phase  estimate  from  a  continu¬ 
ous  measurement  is  (Refs  11;189  and  15:129) 
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2ba  ‘ 


6  = 


-( 


r(t)cos  (i)t  dtlcosO 


r(t)sin  tot  dtlsinO^  .  (4-45) 


In  order  to  give  an  idea  of  the  forms  assumed  by  Eq. 
(4-39)  for  a  specified  number  of  observations,  a  few  illus¬ 
trative  examples  are  given  in  Table  I.  Finally,  in  order 
not  to  disrupt  the  continuity  of  the  present  developments, 
verification  of  Eq.  (4-39)  through  computer  simulation  is 
reserved  for  the  last  section  of  this  chapter. 


Maximum  Li)telihood  Estimate 

The  ML  estimate  of  a  parameter  6  observed  in  noise  is 
based  on  the  conditional  density  of  the  observations  given  6 
only.  This  is  performed  when  the  statistical  description  of 
theta  is  not  available  (Ref  14:65).  It  is  reasonable  to 
assume  that  0  is  equally  li)cely  to  occur  in  the  interval 
(-TT,TT)  not  having  any  other  a  priori  information.  The 
density  of  9  is  therefore  modeled  as 

^9  i  '  -71  <  e  <  TT 

=  0  ,  elsewhere  ,  (4-46) 

2 

with  zero  mean  value  and  variance  n  /3.  From  Eq.  (4-46),  it 

2 

is  found  that  F(0)  =  0  (equivalent  to  a  large  in  Eq. 
(4-37)),  and  the  ML  estimate  is  defined  by  Eq.  (3-33). 
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TABLE  I 


Examples  of  the  Maximum  A  Posteriori  Phase 
Estimator  Form  for  k  Observations 


k 

0(r^  . 

..  r,^) 

3 

3/3”  -'3 

-T—  (2rT-rT-r,)k  cos0(r)  -  4(r„-r,)k  sinO (r) 

4  i/iJO  —  z2jo  — 

H 

/2  (r^-r^ ) k^cosO (r ) 

-  /2  (r2-r^ ) k^sinO (r ) 

3/3 

4  ^2(r^-r^)+r2- 

r^-r^+r^lk  cosO  (r) 

J  J  D  O  — 

6 

-r,)k  sinO (r) 

DO  — 

T 

r 

CO 

[ 

/  r (t) cosotdt] k  cosO 

-  [  /  r (t) sinotdt] k  sin9 

• 

'o  *' 

A 

2 

qkbUg 

k 

o 

tiN 

o 

\r 

A 

2bal 

P 

% 
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Equation  (4-36)  then  bccoir.os 


k  k 

Z  a.r.sinO(r)  +  Y.  0.r.cos6(r)  =  0 


j  =  l 

for  k  >  3  . 


(4-47] 


(4-47) 

reduces 

to 

r  ^ 

I  -  I 

3  -r  . 

-1 

j=i 

3  3 

— 

tan 

k 

f 

ML 

a  .  r  . 

L  j=i 

3  3J 

,  k  >  3  . 


(4-48) 


Equation  (4-48)  is  the  ML  estimate  of  the  phase  based  on 
discrete  observations.  The  condition  that  k  >  3  is  main¬ 
tained  since  for  k  =  1,  the  tangent  of  0 (r )  is  not  defined 
(zero  divided  by  zero)  and  for  k  =  2  the  summations  in  Eqs. 
(4-29),  (4-30),  and  (4-35)  do  not  equal  zero.  Again  for 
comparison,  as  k  becomes  very  large  and  with  F(6)  =  0,  Eq. 
(4-45)  gives  (Ref  15:129) 


=  tan 


f  r(t)  cos  ojt  dt 
-1  _0 _ 

J  r(t)  sin  wt  dt 

L  o 


(4-49) 


Equation  (4-49)  is  the  ML  phase  e-:timate  made  from  a  con¬ 
tinuous  measurement. 

Equation  (4-48)  takes  on  different  forms  for  each  value 
of  k.  In  order  to  visualize  how  it  changes,  a  few  illustra¬ 
tive  examples  are  given  in  Table  II.  These  results  agree 
with  the  findings  of  Wyant  (Ref  16:264),  which  inspired  the 
work  of  this  thesis,  although  he  assumed  shot  noise  instead. 
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TABLE  II 


Examples  of  the  Maximum  Likelihood  Phase 
Estimator  Form  for  k  Observations 


1 

tanO  (r,  ...  r,  ) 

1  k 

3 

/3  2ri-r2-r3 

4 

^l"^3 

6 

/3  2(r;^-r4)+r2-r3-r3+r^ 

3  r2+r3-r3-rg 

0.3129  (r^-r^^)  +  0 . 2980  (r2-r^Q-r32+r2Q) 

0. 0967  (r2+rj^Q-r^2-r2Q)  +  0 . 1839  (r3+rg-r33-r^^ 

20 

t0.2531(r3-rg-r^3+r^^)  +  0 . 1839 (r^-rg-r^^+r^g ) 
•••+0.2531  (rg+rg-r3^-r^g)  +  0 . 2976  {r^+r^-r^g-r^.^ )  •" 

...^0.0967(r3-r^-r^3trj^^) 

+  0.3129  (r^-r, .) 

D  iO 

j  r(t)cos  wt  dt 

oo 

C 

/.T 

J  r(t)sin  tot  dt 

0 
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He  originally  noted  that  k  >  3,  and  proposed  four  inte¬ 
grations.  The  result  corresponds  to  the  entry  for  k  =  4  in 
Table  II. 

Before  proceeding  to  the  algorithm  verification,  it  is 
necessary  to  determine  (or  predict)  how  good  an  estimate 
can  be  obtained  from  Eqs.  (4-36)  and  (4-48).  Therefore, 
the  question  of  performance  remains  to  be  addressed. 

The  conditions  for  which  photon  counting  yields  accu¬ 
rate  estimates  are  to  be  investigated  in  the  next  sections. 

Performance  of  the  Estimators 

One  measure  of  performance  of  an  estimator  is  generally 
given  by  the  variance  of  the  estimate  with  respect  to  the 
true  value.  This  measure  of  performance  is  known  as  the 
Mean-Square  Error  (MSE) ,  and  in  the  case  of  unbiased  esti¬ 
mates  (zero-mean  errors) ,  it  is  the  variance  of  the  error 
itself . 

For  nonlinear  estimates,  the  MS  error  is  not  simple  to 
compute,  and  a  lower  bound  on  the  error  is  sought  instead. 
The  most  widely  used  bound  is  the  one  given  by  the  Cramer- 
Rao  inequality  (Ref  14:66-73).  An  approximate  MS  error 
and  the  Cramer-Rao  (lower)  Bound  (CRB)  will  be  computed  in 
this  paper  and  a  comparison  v/ill  be  made  to  establish  the 
validity  of  the  CRB  bound  to  measure  the  performance  of  the 
joint  estimation  problem  addressed  in  Chapter  VI. 

Mean-Square  Error.  The  MSE  of  an  estimate  is  defined 
as  the  expected  value  of  the  square  error  between  the 
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eftimate  6 (r)  and  the  actual  value  of  0  (Refs  14:64,  17:412). 


Let  the  error  be  defined  as 

^  (0  -  0(r))  . 


(4-50) 


The  exact  MSE  is  then  given  by  (Ref  14:56) 


.00  oo 


E[c‘]  = 


J  d0  y*  dr(0  -  0  (r) )2fg^^  (0,r)  .  (4-51) 


—  00  — OO 


The  joint  density  is  given  by  the  product  f  |  q  (£l  ^ )  f  ^  (0 )  . 

For  the  MAP  estimate,  0 (r)  and  the  two  densities  are  given 
by  Eqs.  (4-39),  (4-14),  and  (4-37).  For  the  ML  estimate, 

0 (r )  and  the  densities  are  given  by  Eqs.  (4-48),  (4-14),  and 

(4-46) .  The  direct  computation  of  Eq.  (4-51)  is  difficult 
to  perform.  However,  an  indirect  computation  suggested  by 
Sage  and  Melsa  (Ref  11:189)  is  performed  here  for  discrete 
observations  and  yields  a  result  that  depends  on  the  energy 
of  the  system  for  large  signal-to-noise  ratios.  These  deri¬ 
vations  are  developed  in  the  following  sections. 

The  MAP-MS  Estimation  Error.  The  MAP  estimate  of  6 
when  the  a  priori  density  is  Gaussian  is,  from  Eq.  (4-36), 


0 (r)  =  -  Z  g.r.k  cos0(r)  -  Z  a.r.k  sinO (r)  , 


j  =  l 

where,  from  Table  I, 


J  J  o 


.  A  ^^^0 
^o  irN 


3  =  1 


3  3  o 


(4-52) 


(4-53) 
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If  the  expression  for  r^  given  by  Eq.  (3-16)  is  substituted 
into  Eq.  (4-53),  then  the  MAP  estimate  becomes,  in  terms  of 
signal  and  noise. 


e(r)  =  - 


E  B  •  s .  (0 ) k  cosO  (r) 
j  =  l  3  3  o 


E  B -n .k  COS0 (r) 

j=l  “ 


^  ^ 

E  QjSj (0)k^sin0  (r)  -  E  a ^n^k^sinO (r)  , 

^  (4-54) 

where  Sj(0)  is  given  by  Eq.  (3-25).  After  some  processing 
of  Eq.  (4-54),  the  estimate  becomes 


3  J.  ^  ^ 

^  k  E  {B.cos0(r)  +  a.sin0(r)} 
qk  o  3  -  3  - 


hT  ^  r  '^7 

•y—  k  E  {B^ot.cos0  COS0  (t)  ~  B^sin©  cos0  (r) 
ZTiq  o  j  — J  J  J  ■” 


+  afcosO  sin0  (r)  -  ajD^sinG  sin0 (r) } 


k  E  {B-n.cos0(r)  +  a.n.sin0(r)} 
°  j=l  3  3  33 


(4-55) 


The  equalities  of  Eqs.  (4-30)  and  (4-35)  can  now  be  substi- 

k  k 

tuted  into  Eq.  (4-55).  By  setting  E  a.B.  =  E  a.  = 
k  j=l  ^  ^  j  =  l  ^  ^ 

E  B-  =0  and  expanding  the  products  of  sin0-cos0(r)  and 
3  =  1  ^  . 

cos0-sinO(r)  into  sums,  Eq.  (4-55)  becomes 


^  k  E  {3^[sin(0  +  0 (r)  )  +  sin(e  -  0  (r) ) ] 


4TTq  o  ‘"j 


-  aj[sin(0  +  0  (r )  )  -  sin(0  -  0  (r)  )  ] 


-  k  E  {3.n.cosO(r)  +  a.n.sin0(r)}  .  (4-56) 

o  j  =  l  0  3  3  3 

The  error  defined  in  Eq.  (4-50)  is  expected  to  be  small; 
otherwise,  the  estimate  is  meaningless.  To  inS'  e  this 
condition,  the  approximation 


sin (0  -  0  (r) ) 


(4-57) 


can  be  used  in  Eq.  (4-56).  By  observing  from  Eq.  (4-2?) 

k  2  ^2 

that  E  a.  =  E  6^  =  0,  Eq.  (4-56)  can  be  transformed 

j=l  ^  j=l  ^ 

to  yield  an  expression  for  the  error;  so,  it  becomes 


.  2  ,  .  .  2TTqk 

2  '''k®  bT  ’''k 

V0 


E  {6.n.cos0(r)  -  a.n.sin0(r)}  , 
j=l  1  3  33 


(4-58) 


where  new  parameters  have  been  defined  as 


A  ru  V  2.-1 
Y.  -  [k  E  a.]  , 

^  j=l.  =5 


(4-59) 


which  is  just  a  number  that  depends  on  the  particular  value 


of  k.  and 
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~  ^  <  t  <  t^+T  ,  (4-60) 

which  is  the  ac  component  of  the  energy  in  the  interval 
(ti,ti+T)  of  the  signal  of  Eq.  (2-9). 

The  first  check  to  be  made  on  the  estimate  0 (r)  using 
Eq.  (4-58)  is  that  for  bias  (Ref  17:404)  .  Thus,  by  recall¬ 
ing  from  Eq.  (4-7)  that  Eln^]  =  0,  the  conditional  expecta¬ 
tion  of  the  error  given  0  is 

tt^N 

"  - 1  E[0  (I)  I  0]  .  (4-61) 

^a°0 

Solving  for  E[0(r)|0]  and  noticing  that  E[9|0]  =  0,  Eq. 

(4-61)  gives 

E[6(r)|0]  =  e[l  +  TT^N^Yj^yE^Og]"^  .  (4-62) 

/N 

Thus,  the  conditional  expected  value  of  0 (r)  is  a  scaled 
version  of  the  true  value  of  0.  The  estimate  approaches 
the  true  value  as  the  signal  to  noise  energy  ratio  in 
(ti,ti+T),  E^/N^,  is  made  very  large,  which  is  expected  when 
the  measurement  is  made  in  the  absence  of  noise.  Thus,  in 
the  limiting  case,  and  only  in  that  case, 

lim  E[0  (r) 1 0]  =  0  ,  (4-63) 

E  /N 
a  o 

and  the  MAP  phase  estimate  is  asymptotically  unbiased  (Ref 
17:404) . 
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The  second  computation  to  be  made  from  is  that  of 

the  MS  error  between  0 (r)  and  0.  The  MS  error  is  defined 

2 

as  the  expected  value  of  The  first  order  statistics 

needed  to  asymptotically  characterize  the  MSE  are 


E[e] 

E[0  (r)  ] 
E[e^] 


0 

E[E[0(r)l0]]  =  0 

E[0]  -  E[0  (r) ]  =  0  . 


(4-64) 

(4-65) 

(4-66) 


In  such  a  case,  large  E,/N  ,  the  MS  error  is  both  the  vari- 

a  O 

ance  of  0 (r)  and  the  variance  of  which  will  be  defined 
asymptotically  as 


V  (e  )  lim  V  (e  ) 

a  ^  E  /N  -CO  r 

a  o 


=  lim  E[c^]  . 

E  /N  -0° 
a  o 


(4-67) 

From  Eq.  (4-58)  the  second  order  moment  of  is  given  by 


E[cJ]  =  E((0(r))2] 

Jl  O  d  C 

+  [2irqj^Yj^/bT]  ^ 

E[{  Z  (B.n.cos0(r)  +  a . n . sinO (r ) ) } ^ ]  ,  (4-68) 

j=l  3  1  33 


where  the  identity  Lln^]  =  0  was  used, 
tion  term  in  Eq-  (4-68)  reduces  to 


The  second  expecta- 


N^T  k  , 

-V-  ^ 

2q'^k  j  =  l  J 


f 
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where  use  of  the  identities  of  Eqs.  (4-8)  ,  (4-29)  and  (4-30) 
was  made.  Finally,  by  using  Eqs.  (4-59)  and  (4-60) ,  Eq. 
(4-68)  becomes 


E[0^]  =  [Tr^N^Y,^/E^aQ]^E[(e(r))2] 


(4-69) 


Equation  (4-69)  is  the  MS  error  of  the  phase  estimate.  The 

2 

difficulty  with  this  expression  is  that  E[(0(r))  )  must  be 

evaluated  from  the  density  function  of  Eq.  (4-14)  (Ref  14:64) 
However,  if  the  ratio  E^/N^  is  made  very  large,  the  error 


variance  becomes,  from  Eq.  (4-67) 


n^N  Yi 
o'k 


(4-70) 


It  will  be  beneficial  to  inquire  what  the  effect  of  Yj,  in 
Eq.  (4-70)  is  as  k  varies  from  k  =  3  to  the  limit  where  it 
becomes  very  large.  A  plot  of  Yj^  versus  k  for  large  signal- 
to-noise  ratio  (SNR)  is  shown  in  Figure  7  as  a  normalized 


variance 


''a<‘^r>Ea 


2-1 

A  plot  of  Yi,k;  that  is,  [  S  a.]  ,  versus  k  is  shown  in 

j=l  3 

Figure  8  to  show  how  fast  the  (normalized)  variance,  Yj^f 
approaches  linearity  as  k  increases.  From  Figure  7  it  is 
observed  that  Yj^  tends  to  stabilize  at  around  0.051  as  k 
becomes  large.  This  corresponds  to  the  asymptotic  slope  of 
the  curve  in  Figure  8.  Thus,  equation  (4-70)  becomes 
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Fig  8.  Relationship  between  snd  K  as  a  Product 
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1 . i .m. 

k  ->  00 
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o 


(4-71) 


The  MS  error  of  a  phase  estimate  for  large  signal-to-noise 
energy  ratio  in  (t^,tj^+T)  is  just  a  multiple  of  the  inverse 
SNR  scaled  by  a  factor  that  depends  on  the  number  of  the 
observation  interval  subdivisions,  k.  A  few  examples  to 
illustrate  Eq.  (4-70)  are  given  in  Table  III. 

The  ML-MS  Estimation  Error.  The  same  procedure  used 
in  the  previous  section  to  determine  the  MAP-MS  error  can  be 
used  for  the  ML  case.  Using  Eq.  (4-48)  as  a  starting  point 
and  substituting  for  r^  given  by  Eq.  (3-16) ,  it  is  found  that 
the  ML  estimate  is  unconditionally  unbiased;  that  is, 


E[e(r)je]  =  0 


(4-72) 


This  is  easy  to  verify  by  observing  that  for  ML,  of  =  »  in 

0 

the  MAP  equations.  Thus,  Eq.  (4-69)  reduces  to  Eq.  (4-72) 
without  any  SNR  restrictions.  Completion  of  the  procedure 
yields  an  unconditional  error  variance  given  by 


''ar<  =  r>  = 


tt^N  y, 
o  k 


(4-73) 


Again  this  is  easily  verified  by  letting  o^  =  “  in  Eq. 
(4-69) .  A  different  procedure,  however,  carried  out  by 
Raemer  (Ref  9:263,458)  to  compute  directly  the  Root  Mean- 
Square  (RMS)  error  defined  as 
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TABLE  III 


Examples  of  the  MAP  Phase  Estimator 
Error  Variance  for  Large  SNR 


k 

V  (e  ) 
a '  r' 

3 

.2  N  /2  N  /2 

4tt  o'  ^  o' 

27  E  g 

a  a 

4 

^2  N/2  N  /2 

TT  o  _  ^  2337  ° 

a  a 

5 

T  2  N^/2  N  /2 

o  _  ^  1427  ° 

17.27  E  x.xi../ 

a  a 

6 

2  N  /2  N/2 

a  w 

15 

-„2  N/2  N^/2 

19.45  E  "  1.0147  g 

a  a 

20 

^2  N/2  N/2 

19.57  ?  =  ° 

^  ci 

00 

N/2  N/2 

o  _  o 

E  E 

a  a 
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[V  (c  )] 
ar  r 


1/2 


k  =  “°  / 


(4-74) 


where  0  is  as  in  Eq.  (4-49)  did  show  a  SNR  dependence  of 
the  ML  estimate.  By  expanding  the  arc  tangent  in  Eq.  (4-49), 
Raemer  obtained  similar  results  with  the  condition  of  a 
large  E^/N^  ratio.  Specifically, 


4  p . BnT 
1 


2E. 


(4-75) 


where  and  Bn  were  defined  parameters  of  the  signal  and 
noise  treated  as  narrowband  processes.  Therefore,  the  MAP 
and  ML  error  variance  are  the  same  and  approach  zero  when 
the  signal  is  strong  relative  to  the  receiver  noise.  Thus, 
Table  III  is  equally  an  ML  comparison  of  the  error  variance 
in  terms  of  the  observations  k.  To  show  graphically  the 
relative  improvement  of  entries  in  Table  III,  define  in 
logarithmic  terms 


Imp(dB)  -  10  log 


V^r(^r)  ^^=3 


V  (e  ) 
ar  r 


=  10  log  (Y3/Yj^) 


(4-76) 


The  relative  improvement  in  decibels  (dB)  of  the  error 
with  respect  to  the  error  in  three  subintervals  is  shown 
in  Figure  9. 

The  important  conclusion  of  this  section  is  that  in 
the  presence  of  thermal  (receiver)  noise,  the  error  in  the 
phase  estimate  is  strictly  a  function  of  the  signal-to-noise 
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ratio.  The  ideal  procedure  is  to  perform  a  current  measure¬ 
ment  (k  =  ”) ,  for  which  the  error  is  the  inverse  SNR  multi¬ 
plied  by  a  factor  of  1.  The  procedure  is  suboptimal  if 
discrete  piboton  counts  are  made  (k  <  «>)  ,  for  which  case  the 
same  error  is  scaled  by  an  appropriate  factor  determined  by 
k.  Referring  tj  the  curve  in  Figure  9,  the  continuous  cur¬ 
rent-based  estimate  offers  an  improvement  of  1.65  dB  over 
the  discrete  photon  count  based  on  only  k  =  3  integrations. 
Notice,  however,  that  if  the  hypothetical  CCD  detectors  can 
provide  15  or  more  integrations  over  T,  then  the  improve¬ 
ment  of  a  continuous  measui'cm.ent  over  photon  counting  is 
only  0.064  dB.  Thus,  provided  the  SNR  restrictions  are  met, 
the  photon  counting  algorithms  offer  a  viable  alternative 
for  phase  estimation  when  the  signal  levels  are  low. 

Considering  the  extremes,  when  the  SNR  is  low,  neither 
a  current  measurement  nor  a  photon  count  will  yield  an  accu¬ 
rate  phase  estimate.  On  the  other  hand,  in  the  absence  of 
noise,  the  error  is  zero  (E^/N^  =  as  expected  from  a 
perfect  measurement.  This  is  easily  proven  by  substituting 
Tj  =  Sj(0),  given  by  Eq.  (4-23),  back  into  Eqs.  (4-36) 

(with  F(0)  =  0)  or  (4-48).  With  the  help  of  Eqs.  (4-29), 
(4-30)  ,  and  (4-35)  the  result  obtained  is  0  (r)  =  0  as 
expected.  This  was  also  verified  computationally. 

The  level  of  SNR  to  assure  with  a  given  certainty  that 
an  estimate  is  within  a  specified  phase  range  has  been 
addressed  in  the  literature  and  v'ill  not  be  repeated  hero 
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(Ref  9:266).  Instead,  a  genercil  observation  will  be  made 
from  the  results  of  the  computer  simulation  discussed  at 
the  end  of  this  chapter. 

In  order  to  complete  the  error  analysis,  it  is  desir¬ 
able  to  inquire  what  the  minimum  MS  error  is,  and  the  con¬ 
ditions  under  which  it  is  attained.  A  good  description  is 
given  by  the  Cramer-Rao  bound  on  the  minimum  MSE  although 
other  bounds  are  also  available  (Ref  14:71).  The  Cramer- 
Rao  approach  will  be  considered  in  the  next  section. 

Cramer-Rao  Lower  Bound  on  the  Minimum  MS  Error ,  The 
lower  limit  on  the  value  that  the  minimum  MS  error  (MHSE) 
can  have  can  be  calculated  without  actually  having  to  kno’w 
the  estimate.  For  the  case  of  an  unbiased  estimate,  the 
CRB  bound  is  given  by  (Ref  14:72) 

CRB  =  {E[  (^  Inf^^Q  (r,0)) 

8^  -1 

=  -{E[^  Inf^  „  (r,0)]  }  .  (4-77) 

For  the  case  of  a  biased  non-random  parameter,  a  bound 
expression  is  given  by  Van  Trees  (Ref  14:147). 

In  order  to  use  Eq.  (4-77)  in  the  problem  at  hand,  it 
is  necessary  to  work  with  the  large  SNR  restriction,  for 
which  the  estimate  £' (r)  was  shown  to  be  asymptotically 
unbiased.  Thus,  the  bound  vs  given  by 

V^ic)  ^  CRB  ,  E^/N^  ^  .  (4-78) 

at  a  O 
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Before  letting  the  SNR  become  very  large  let 


Var^^r)  >  CRB  =  lnfj.|g(r]0)  +  lnf^(0)]}  ^ 

(4-79) 

For  the  observation  in  white  Gaussian  noise,  it  is  conven¬ 
ient  to  specialize  Eq.  (4-79)  further  before  considering 
the  phase  problem.  From  Eq.  (4-17) 


36  ^^^rjO 


(r|9)  = 


^  Z  r  ~  s  (0 ) 
NqT  ""j  36 


-  ^  i.  iu  • 

o  3  =  1  ■’ 

(4-80) 

The  second  derivative  of  Eq.  (4-80)  with  respect  to  0  is 


30 


2lnf^|^(rl9)  = 


O  2.  k  ( 

23Ji  I  ’  fr 

NT  I 

o  3  =  1  ( 


-  Sj(0)] 


_3J 

30 


f  >j<i> 


-m' 


(4-81) 


The  CRB  bound  then  becomes 


CRB  = 


E  E 

I  "o’'  j  =  l 

_  /?s.(0)V 

\  30  / 


(r^  -  s.  (0) ) 


+  E 


36 


2  ^3^®^ 


1  »  -1 


30 


2  ^ 


(4-82) 


By  observing  that 
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and 
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(4-85) 


By  use  of  Eq.  (4-85)  in  Eq.  (4-83),  tlhe  bound  becomes 


(2q^k  ■  bT  2 


c™  “  +  B.oosei^  -  E(^  P(e)]|  ^ 


(^)^  I  [u^sin^O  +  B^cos^e  +  a.B-sin2e] 
I  ^o  j=l  ^  3  D  D 


1-^ 


-  E[^  F(0)]j  ,  k  >  3  . 


(4-86: 


Invoking  the  identities  of  Eqs.  (4-29)  and  (4-30)  one  more 
time,  Eq.  (4-86)  can  be  simplified  to 


kb^T 


4it  N^/2  j=l 


Z  -  E[^  F(e)l| 


a  1"^ 

2-^  -  E(^e-  f«>'(  .  X  i  3  , 


for  E  /N  .  (4-87) 

a  o 

Equation  (4-87)  is  the  lower  bound  on  the  MS  error  in  the 
estimate  of  0  given  by  the  Cramer-Rao  inequality.  It  is  a 
function  of  the  ac  component  of  the  signal  energy  E^,  the 

a 

noise  energy  and  the  observation  subinterval  k  repre¬ 
sented  by  Yj^-  It  is  also  a  function  of  the  statistical 
description  of  6  represented  by  the  density  fg(0).  In  the 
particular  case  where  the  phase  is  modeled  as  a  Gaussian 
random  variable  with  the  density  given  by  Eq.  (4-37),  then 
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(4-88) 


EI^  F(0)1  =  -  4  • 

“e 

The  bound  on  the  error  variance  becomes 


V  (e  ) 
a  '  r‘ 


ol 


-1 


2»,  2 
^  Vk°e 


k  >  3 


for  E^/N  -»  “  . 
a  o 


(4-89) 


In  the  limiting  case  as  k  becomes  very  large,  1/Yj^  -  19.6 
(see  p.  44) ,  so 


l.i.m.  V  (e„)  > 

.  a  iT 

k  ->  <» 


2E  , 
a  ,  1 

N  2 

O  OqJ 


-1 


2 

y  o 


N  +  2E  o" 
o  a  0 


for  Eg/N^ 


(4-90) 


Equation  (4-90)  gives  the  MAP  lower  bound  on  the  error  var¬ 
iance  based  on  a  continuous  measurement.  Finally,  when  the 
large  SNR  condition  is  used,  Eq.  (4-90)  becomes 


N 

lim  {l.i.m.  V  (c  )  }  i 

E  /N  ->»  k  -»■  0°  ^  ^  '^^a 

n  o 


(4-91) 


This  is  precisely  the  result  given  by  Eq.  (4-71) .  There¬ 
fore,  as  the  term  E^/N  grows  unbounded,  the  true  error 

a  o 
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variance  approaches  the  CRB  with  equality;  the  MAP  estimates 
0 (r )  and  6  are  asymptotically  efficient  (Ref  14:276).  Thus, 
the  Cramer-Rao  bounds  are  also  given  in  Table  III.  The 
same  observation  is  made  for  the  ML-CRB  on  the  error  vari¬ 
ance.  For  the  ML-CRB,  from  Eq.  (4-46), 

E[^  F(0)  ]  =  0  ,  (4-92) 

and  Eq.  (4-77)  reduces  to  (Ref  14:66) 

-1 

CRB  =  -{E[-^  Inf^,.  (rjo)]}  .  (4-93) 

30 

Therefore  the  bound  for  the  ML  error  is,  from  Eq.  (4-89)  as 
2 

Og  gets  large, 

Va(er)  ^  — g----  ,  k  >  3  ,  (4-94) 

Si 

or  from  Eq.  (4-90)  , 

N 

l.i.m.  V^(e^)  >  2^  .  (4-95) 

k  -*■  “>  ■‘^a 

From  Eq.  (4-90)  it  is  observed  that  when  the  signal  is 
,  2 

weak,  the  error  is  limited  by  the  variance  Og.  But  this  is 
not  the  same  result  expressed  by  Eq.  (4-69) ,  and  therefore 
the  Cramer-Rao  bound  becomes  meaningless  in  such  a  case. 

In  conclusion,  the  analysis  made  in  the  Gaussian  noise 
context  indicates  that  under  thermal  noise  limited  condi¬ 
tions,  measurement  of  a  phasefront  cannot  be  made  accurately 
except  for  large  signal-to-noise  ratios.  For  low  light 
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levels  conforming  to  this  restriction,  the  photon  counting 
technique  is  quite  acceptable,  but  when  the  signal  level 
permits,  a  current  measurement  is  most  appropriate.  At 
this  point,  a  verification  of  the  algorithms  derived  in 
Eqs.  (4-39)  and  (4-48),  as  well  as  a  performance  evaluation, 
is  overdue.  This  was  done  in  a  computer  simulation,  and  is 
presented  in  the  following  section. 


Verification  of  the  MAP  and  ML  Estimators 

A  simulation  program  to  test  Eqs.  (4-39)  and  (4-48)  was 
written  in  FORTRAN  and  run  on  the  CDC  6600  computer  system. 
After  the  parameters  for  the  simulation  have  been  input, 
the  program  generates  the  signal  counts  Sj(6),  computed 
from  the  true  phase  input  and  Eq.  (3-24),  and  adds  white 
Gaussian  noise  counts  n^ ,  computed  from 

P.  NT  1/2 

n.  =  -J-  [-^1  ,  (4-96) 

3  q  2k 


where  P .  is  a  number  from  a  zero-mean,  unit-variance, 

3 

Gaussian  random  number  sequence,  generated  using  a  subrou¬ 
tine  from  the  IMSL  library.  The  program  then  estimates  the 
phase  based  on  the  noise-corrupted  measurements  r^  using 
Eqs.  (4-39) ,  (4-41)  and  (4-48)  as  needed. 

The  parameters  chosen  for  this  simulation  are  a  25 
kilohertz  modulation  frequency,  a  0.04  millisecond  observa¬ 
tion  period  (required  by  the  choice  of  N  =  1  in  Eq.  (3-22) ) 
and  a  signal  level  in  the  order  of  1  microamp.  Although 
the  algorithm  is  independent  of  signal  level  (also  verified 
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by  using  10  milliamps)  this  number  was  entered  to  simulate 
the  output  of  a  detector  under  low  signal  levels.  The  num-- 

p 

ber  of  counts  r^  thus  generated  are  of  the  order  of  10 

photoelectrons  and  below.  In  the  0-04  millisecond  observa- 

•  13 

tion  time  used,  this  corresponds  to  less  than  10  photo- 

electrons  per  second,  a  rate  which  is  in  the  category  of 

low  photon  coherence.  The  simulation  results  are  contained 

in  Figures  10-19. 

Simulation  Results  for  the  MAP  Estimator.  The  I«1AP 

estimator  simulation  is  a  series  of  tests  performed  on  Eq. 

(4-39)  to  verify  its  performance  as  an  estimator.  In  these 

tests,  Eq.  (4-39)  was  used  to  estimate  a  )cnown  phase  given 

different  signal-to-noise  ratios  following  the  procedure 

outlined  in  the  previous  section.  The  tests  were  then 

repeated  to  estimate  different  phase  angles  at  a  fixed  SNR. 

Several  such  tests  were  performed  and  the  results  presented 

here  are  typical.  The  first  set  is  shown  in  Figure  10. 

These  curves  were  obtained  using  a  phase  variance  of  0.8 
2 

rad  ,  chosen  as  the  safe  maximum  deviation  for  which  Eq. 
(4-38)  can  be  used  to  obtain  Eq.  (4-39);  and  using  a  test 
phase  of  0.5234  radians  (30°),  which  seemed  a  logical  choice. 
The  five  curves  shown  are  the  phase  estimates  plotted  as  a 
function  of  the  algorithmic  form  used  (determined  by  k)  for 
the  indicated  signal-to-noise  ratios  (10  dB  to  30  dB) .  The 
same  noise  counts  were  used  in  each  curve. 

From  this  and  other  tests  performed  (by  changing  the 
noise  seed  to  generate  Pj)f  it  is  observed  that  the  estimate 
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Fig  10.  MAP  Simulation  Results  -  Single  Phase  at  Different  SNR' 


deviations  are  larger  for  lower  values  of  k,  but  improve  as 
the  SNR  increases.  Thus,  for  large  SNR's,  consider  that  to 
be  50  (17  dD)  and  above,  the  benefit  of  the  simpler  algo¬ 
rithm  forms  in  Table  I  can  be  used  to  an  advantage  (less 
computation  time) •  The  same  is  true  for  the  ML  approach 
(see  Table  II) .  The  variance  of  these  estimates  is  shown 
in  Figure  11. 

Figure  12  is  a  plot  of  the  estimates  of  five  phase 
angles  obtained  with  a  SNR  of  17  dB.  They  range  from  0.2 
radians  (11.5°)  to  1.4  radians  (80°).  No  significant  dif¬ 
ferences  in  the  pattern  are  observed,  A  small  discrepancy, 
however,  had  to  be  fixed  in  this  particular  plot;  the  phase 
estimate  ^(r^)  for  0  =  1.4  radians  made  by  using  four  sub¬ 
intervals  had  a  negative  sign.  This  occurred  both  in  the 
MAP  and  the  ML  test.  However,  it  cannot  be  inferred  that 
the  estimator  is  more  sensitive  as  the  phase  becomes  larger. 
The  variance  of  these  estimates  is  shown  in  Figure  13.  Both 
Figures  11  and  13  seem  to  afirm  the  theoretical  prediction 
of  Figure  7  :  improvev..  performance  as  k  (and  SNR) 

A  test  to  check  the  effect  of  a  smaller  a  priori  phase 
2 

variance  Og  was  also  run  and  is  shown  in  Figure  14.  The 

2 

variance  was  chosen  to  be  0.274  rad  (the  square  of  the 
test  phase) .  A  slight  improvement  was  obtained  with  respect 
to  the  curves  of  Figure  10.  The  comparison  can  be  made  more 
easily  by  looking  at  the  estimates  variance  shown  in  Figure 
15.  Here,  the  improvement  is  more  obvious  for  the  lower 
values  of  k,  and  in  particular  for  the  10  dB  curve;  that  is, 
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for  the  estimates  obtained  with  the  smaller  SNR  used.  This 
seems  to  confirm  the  discussion  pertaining  to  Eq .  (4-90), 

that  estimates  from  noisy  measurements  are  more  dependent 
on  the  a  priori  information,  a  useful  characteristic  of  the  MAP 
estimator.  Similar  tests  were  also  run  to  verify  the  ML 
algorithms,  and  are  discussed  in  the  following  section. 

Simulation  Results  for  the  ML  Estimator.  The  ML  esti¬ 
mator  simulation  is  a  series  of  tests  performed  on  Eq. 

(4-48),  similar  to  the  ones  performed  on  the  MAP  estimator. 

The  main  difference  lies  in  the  straightforward  computation 
given  by  the  inverse  tangent  form  of  Eq.  (4-48).  Here,  no 
a  priori  information  (phase  variance)  is  used.  The  results, 
shown  in  Figures  16-19,  are  surprisingly  close  to  the  MAP 
results  already  discussed,  especially  as  k  increases.  Thus, 
the  ML  approach  can  be  used  to  advantage  when  the  signal- 
to-noise  ratios  are  high  enough  to  perform  nearly  as  well  as 
the  MAP  estimator.  From  these  results.  Figures  10-19,  it 
can  be  seen  that  high  enough  may  be  10  dB  and  higher.  Thus, 
given  SNR's  better  than  10  dB,  the  ML  algorithm  of  Eq.  (4-48) 
seems  to  be  a  good  estimator.  Its  use,  when  warranted,  has 
the  advantage  of  avoiding  solving  equations  of  the  form  of 
Eq.  (4-41)  which  are  time  consuming.  The  MAP  estimator, 
on  the  other  hand,  is  more  useful  under  noisy  conditions, 
where  the  estimator  v/eights  more  heavily  on  the  a  priori 
information . 

In  the  analysis  presented  in  this  chapter,  the  ultimate 
noise  limited  condition  was  assumed:  that  of  the  detector. 
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Fig  16.  ML  Simulation  Results  -  Single  Phase  at  Different  SNR' 


simulation  Variance  -  Single  Phase  at  Different  SNR' 


6NR=E0 


I 
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Fig  18.  ML  Simulation  Results  -  Multiple  Phase  Angles  at  a  Fixed  SNR 
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Fig  19.  ML  Simulation  Variance  -  Multiple  Phase  Angles  at  a  Fixed  SNR 


In  reality,  and  particularly  for  low  light  levels,  v/hen  the 
signal- to-thermal  noise  energy  ratio  is  indeed  very  large 
as  may  be  the  case  of  the  CC  devices,  then  the  effects  of 
the  signal  shot  noise  predominate.  Therefore,  it  is  neces¬ 
sary  to  consider  the  predominant  shot  noise  case  in  order 
to  have  a  more  complete  analysis.  When  the  phase  0  is  the 
only  random  parameter  in  Eq.  (2-9)  and  consequently  in  Eq. 
(4-23) ,  the  problem  can  be  treated  in  the  Poisson  context. 
An  explicit  use  of  the  Poisson  statistical  description  of 
shot  noise,  ignored  by  Wyant,  will  be  the  basis  of  the 
developments  of  the  next  chapter. 


V  Phase  Estimation  in  Shot  Noise 


When  detector  thermal  noise  is  negligible,  the  process 
is  signal  shot  noise  limited.  For  a  multimode  field  (Ref  2; 
87-94,  212),  the  process  is  well  modeled  as  having  condi¬ 
tionally  Poisson  statistics.  When  the  field  is  single  mode, 
then  it  is  governed  by  Laquerre  (Ref  2:304)  statistics  and 
will  not  be  considered  here.  The  analysis  is  identical  to 
the  one  made  in  Chapter  IV  after  Eqs.  (3-32)  and  (3-33). 

For  multimode  detection,  the  counts  r.  in  the  observation 

3 

interval  (t^,t^+T)  are  independent  and  Poisson  distributed 
when  conditioned  on  the  field  intensity  (Ref  2:295).  The 
density  function  of  the  observables  in  (tj,tj+T)  conditioned 
on  6  is  the  probability  density  of  the  events  (photons 
received) ,  assuming  that  all  the  events  equal  the  observa¬ 
tions  (photoelectrons  produced)  in  each  observation  sub¬ 
interval  T.  Therefore,  the  density  function  of  the  shot 
process  is 


P[N^ 


(5-1) 


where 


,t  .+T 


E[r(t)  ] 


dt 


(5-2) 


The  observation  r(t)  has  the  form 


r(t)  =  s(t,0)  +  '  (5-3) 
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where  the  noise  count  (photon)  rates  have  the  following 

statistics:  (1)  the  signal  shot  noise  A  (t)  is  a  non- 

sn 

stationary,  zero-mean  process  with  covariance  A (t) 6 (t-t ' ) ; 

(2)  the  background  is  assumed  constant  and  infinite  with 

correlation  function  N^,  6  (r  -r^  '  )  <S  (t-t '  )  and  a  stationary 
count  rate  Aj^  =  ^Qj^^QD^n/hf^ ,  where  is  the  spectral 

background  noise  strength,  is  the  optical  bandwidth  and 
Dg  is  the  number  of  spatial  modes  (Ref  2:212,213,298);  and 

(3)  the  stationary  detector  dark  current  has  dc  content  A^ 
and  covariance  A^5(t-t'). 


Maximum  A  Posteriori  and  Maximum  Likelihood  Estimates 

With  the  above  conditions  established,  the  MAP  and  ML 
phase  estimates  can  be  found  using  Eq.  (5-1)  as  the  starting 
point.  Thus,  Eq.  (5-1)  becomes 


(s.(9)  +  (Aj^+A^)T/k)  ^ 


exp[-(Sj(e)  +  (Aj^+A^)  T/k)  ]  .  (5-4) 


In  a  manner  analogous  to  Eq.  (4-14),  the  observation  vector 
r  has  a  conditional  density  function  given  by 


k 
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Unlike  Eq.  (4-14),  Eq.  (5-5)  contains  a  constant  noise  term 
( that  represents  deviation  from  ideal  behavior. 
Use  of  Eq.  (5-5)  in  Eq.  (3-32)  yields  the  MAP  estimator 
equation 

k  r.  as.  (G)  ^ 

I  s.(e)  +  (x.+A  )T/k  ao  “  q  /  air  s(t/0)<3t 

J  i.  J  DU 

+  F(0)  =  0  ,  (5-6) 


where  the  substitution 


k  as.  (9) 
E  — i- 
j=l 


ae 


a  s (t, 0 ) 
ao  q 


dt  = 


as ( t , 0 ) 


ae 


dt 


(5-7) 


and  the  definition  of  Eq.  (4-26)  were  used. 

Use  of  Eqs.  (3-25) - (3-27)  and  (4-24)  in  Eq.  (5-6)  with 

r'^  a 

/  s(t,0)dt  =  0,  where  s(t,0)  is  given  by  Eq.  (2-9), 

O 

yields  the  MAP  estimate 


k  [a  .  sinO  (r) +6  .  cosO  (r )  ]  k/2TT 

Z  r  .  - - ^ - 

j  =  l  ^  [a  .  COS0  (r) -3  ■  sinO  (r)  ]  k/2Ti+ (a+i,  +i  , ) /b 
"  3  -  J  -  b  d  ' 

-  F(0)  =  0  ,  (5-8) 

where  the  substitution  (Ref  2:113)  (ij^+i^)  =  was 

made  to  convert  constant  photon  count  rates  to  dc  noise 
current.  The  ratio 


y 


A  b 
a  +  lb 


(5-9) 
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is  the  fringe  visibility  and  depends  on  the  radiance  of  the 


source.  The  dark  current  i^  is  usually  negligible. 

Equation  (5-8)  is  the  MAP  discrete  estimator  of  phase 
measured  under  Poisson  shot  noise  conditions.  It  can  be 
reduced  further  under  special  conditions,  but  before 
specializing  its  results  and  following  the  practice  of 
previous  sections,  Eqs.  (5-6)  and  (5-8)  can  be  put  in  the 
form  of  continuous  waveform  equations.  Thus,  in  the  limit 
as  k  becomes  very  large,  Eq.  (5-6)  becomes  (Ref  2:298) 

1  /  _ Lit) _  3s(t,0)  _  1  8  s  (t,0),  ^ 

q  /  s(t,0)  +  (i,+i,)  36  <3/36 

DO 

+  F(G)  =  0  ,  (5-10) 


and  the  MAP  phase  estimator  of  Eq.  (5-8)  becomes 


n 

•V-\ 


r  (t)  cos  (ujt+9  ) 


Iq  sin(ojt+0)  +  (a+ij^+i^)/b 


dt  -  F (0)  =  0  .  (5-11) 


The  first  case  to  be  considered  is  when  the  fringe 
visibility  is  approximately  unity.  This  is  more  or  less 
what  is  found  from  the  output  of  the  shearing  interferometer, 
as  can  be  determined  from  Eqs.  (2-10)  and  (2-11).  Thus, 
from  those  two  equations,  a/b  =  0.711;  so,  (a+ij^)  ~  b,  con¬ 
sidering  the  presence  of  a  small  background  current.  Thus, 
by  setting  (a  +  ij^fi^)/b  =  1,  Eq.  (5-8)  can  be  transformed 
using  simple  trigonometric  identities  (Ref  11:225-241)  into 
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E  r.{tan[sin  -  «  (r)  ] 

:  D  D  J  ~ 

-1 

+  2tt/ [  (a  j sinG  (r )  +  P ^ cosO  (r )  ) /k]  } 

-  F(0)  =  0 

for  Y  =  ==  1 

and  Y  >>  »  (5-12) 

where  the  substitution 

tan"^(aj/Bj)  =  sin~^  (a^ Aa^  +  Bj)  (5-13) 

was  made  to  avoid  dividing  by  zero  as  some  B^'s  are  found  to 
be.  Equation  (5-12)  is  therefore  the  discrete  MAP  phase 
estimator  given  Poisson  shot  noise  conditions  and  equal 
amplitude  dc  and  ac  components.  It  is  noteworthy  to  observe 
that  if  k  is  large,  the  second  term  in  Eq .  (5-12)  can  be 
neglected  so  the  estimate  becomes  approximately 

k  - - 

E  r.cot[sin  ^(a./y/a^  +  6^)  -  9  (r)  ] 
j  =  1  ^  3  3  3 

-  F(0)  =  0  ,  k  large  .  (5-14) 


Finally  as  k  becomes  very  large,  from  Kq.  (5-11),  the  con¬ 
tinuous  measurement  estimate  is  found  to  be 
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0 


T 

/ 


r  (t) cot ( 


'ot  +  0 


dt  -  q  F  ( 0 ) 


for 


Y  +  i. 


1  /  Y  >>  i 


d  • 


(5-15) 


The  second  case  to  be  considered  is  when  (a+i^^+i^)  =  0 ,  dc 
components  blocked  (capacitively  perhaps).  For  this  case, 
Eq.  (5-8)  becomes 


k  a.sijiO(r)  +  ;3,cos0{r) 

1  r  .  ^ - 

“1  /N 

j  =  l  UjCOSU  (r)  -  i3jSinC(r) 


-  F('l)  =  0  . 


(5-16: 


Equation  (5-lG)  can  be  also  transformed  using  simple  trigo¬ 
nometric  identities  into 


Z  r, tan [sin  ^ ( 6 • A u  ^  +  6  ^ )  +  0  (r) ] 
j=l  J  1  J  3 

-  F(0)  =  0  ,  dc  blocked.  (5-17) 


This  is  the  discrete  RYF  phase  estimator  given  Poisson  shot 
noise  conditions  and  dc  components  (signal  and  noise) 
blocked.  In  the  limit  as  k  becomes  large,  the  continuous 
measurement  estimators  found  to  be  from  Eq .  (5-11) 

T 

r(t)cot(ojt  +  6)dt-qF(0)  =  0  , 

dc  blocked  .  (5-18) 


The  last  case  to  be  considered  is  for  a  very  low  fringe 
visibility  such  that  (a  +  ij^)  b.  This  condition  is  given 
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under  a  strong  infinite  background  competing  with  the  source 
(target),  and  is  referred  to  as  background  linitod.  Thus, 
for  background  limited  conditions, 

(a  +  ij^  +  i^)/b  >>  [ajCOsG(r)  -  6  ^  sinO  (r )  ]  k/2ii 
and  Eq.  (5-8)  becomes 


I  r.(a.sinG(r)  +  iS  .cosO  (r)  ] 
j=l  3  1  3 


271  (a  +  ij^+i^) 

MS 


F(0)  =  0  . 


(5-19) 


Equation  (5-19)  is  of  the  same  form  as  Eq.  (4-36)  with 

the  term  2:7  (a+i^+.i^) /kb  in  place  of  TiN^/qkb,  Thus,  an  equiv- 

alcnt  noise  can  bo  defi.nod  as  N  —  2q(a+i.  +i,).  The  phase 

o  D  d 

estimator  in  multimode  shot  noise  under  background  limited 
conditions  is  thus  the  same  as  the  estimator  obtained  in 
the  Gaussian  analysis  with  equivalent  noise  2q  (a+ij^+i,^)  . 
Therefore,  if  F(.')  =  -  0/Og/  the  estimate  is  after 

Eq.  (4-39) 


^  (r) 


for  i, 
b 


>  >  b  , 


kbOg  k 

— 7 - — : — T  {-  f  5.r,cosi  (r, 

27.(a.rj^+rj)  g  j 


y.  u  .  r  .  si.nO  (r )  )  , 

j  =  l  3  i 


(5-20) 


and  the  ML  c.stimato  is  the  s^imo  one  eivon  by  I’.q .  (4-43)  . 

In  order  to  illustrate  how  Eqs.  (5-12)  and  (5-17)  cliango 
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with  k  and  to  compare  them  to  Eqs.  (5-20)  and  (4-48) ,  some 
examples  are  given  in  Tables  IV  and  V. 


Performance  of  the  Phase  Estimators  in  Shot  Noise 

Considering  that  the  phase  estimates  found  in  the 
Gaussian  noise  analysis  are  asymptotically  unbiased  for 
high  SNR,  the  Cramer- Rao  inequality  seems  to  be  also  an 
appropriate  measure  of  performance  of  the  shot  noise  phase 
estimator  under  the  same  SNR  restrictions.  From  Eqs.  (4-78) 
and  (5-6) ,  the  MAP-CRB  bound  is  given  by 


CRB  = 


k 

Z 

Lj=i 


"1  \-i 


9F(e) 


s  .  (0)  + 


(Ab+Xd)T/k 


30 


.  (5-21) 


By  making  the  substitutions  of  Eqs.  (3-25)  and  (4-85)  into 
Eq.  (5-21) ,  the  bound  transforms  into 


CRB  = 


bT 


- 


k 

Z 


[g  jSin9+8  jCos9 ]  ^k/2Tr 


I  [^2T7q  [g  jCOS0-8  jSin9  ]  k/2Tr  +  (a+ij^+i^)  /b  J 

"9F(9)1  ( 


39 


i 


(5-22) 


For  Gaussian  phase,  Eq.  (5-22)  becomes 


CRB  =  !  7^  E 


I  2irq 

4  i  M 

\5iq  *o  2  I 


_ _  _  ^  1_\ 

(g  .COS9-B  .sin9)  +2tt  (a+i, +i  ,)  /k.  1  21 

Ld=1  3  3  b  d  bj  Oq  ) 


k 

Z 


(gjSin9+6 jCOs9 ) 


-1 


"9 

1  +  o^bTA  /2T7q 
0  o 


(5-23) 
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TABLE  IV 


Examples  of  the  MAP  Phase  Estimator  Form 
Under  Multimode  Shot  Noise  Conditions 


DC  Blocked,  Eq.  (5-17),  F(0)  =  -  e/a^ 


B 

e  (r^^  ...  rj^) 

3 

0g{rj^cot0  (r)  -  (r2+r2)  tan  (0  (r)  +  ^)  1 

B 

Ogf (ri+r2)cot0 (r)  -  (r2+r^ ) tan0 (r) } 

09 

T 

Oq  f  dt  r  (t)  cot  (a)t+0) 

Background  Limited,  Eq.  (5-20),  F(9)  =  -  Q/Og 


B 

0  (^2^  •  •  • 

3 

B 

/2  (r^-r2)kj^cos0  (r)  -  /2  (r2-r^) k^sin§  (r) 

00 

^  T  ^  •  T  A, 

[J  r (t) cosutdt] k  cos0-[J  r(t)sinutdt]k  sin0 

0  ^0  ^ 

TABLE  V 


Examples  of  the  ML  Phase  Estimator 
Under  Multimode  Shot  Noise  Conditions 


DC  Blocked,  Eq.  (5-17),  F(e)  =  0 


a 

tan6  (r^^  . . .  r^^) 

3 

rj^  /3  -  tan6  (r) 

/3  +  tan§ (r) 

4 

>1  -  ^3 

V  r2  +  r4 

00 

T 

f  r  (t)  cot  (a)t+6  )  dt  =  0 

O 

Background  Limited,  Eq.  (4-48) 


B 

tan§  (r^  . . .  r^^) 

3 

/3  ^^l"^2“^3 

3  ''2-'3 

4 

^1  “  ^3 
^2  ■  ^4 

00 

J 

J 

r’’ 

'  r(t)cosijotdt 

0 

r(t)sina)tdt 

0 
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tane  (r^  ... 


rj^/ (sine  (r) +1. 11)  -  r^/ (sin6  (r) -1. 11) 


r2/ (cose (r) +1. 11)  -  r^/ (cosS (r) -1. 11) 


/  dt  r{t)cot(5;i^  +  1)  =  0 


/ 


where  is  the  expected  value  of  the  sum  with  respect  to  9 . 
For  (a+ij^+i^)  =  b  and  k  very  large,  Eq.  (5-23)  reduces  to 
(Ref  2:300) 


V  (e  )  > 

ar'  r' 


1  +  OgbT/q 


The  ML  bound  is  on  the  other  hand  (Og  large) 


(5-24) 


'^ar^^r^  -  M  ' 


(5-25) 


the  reciprocal  of  the  collected  count  energy.  The  perform¬ 
ance  of  Eq.  (5-20)  as  given  by  the  CRB  bound  is,  from  Eq. 
(5-22)  with  F(e)  =  -  9/0^, 


CRB  = 


(bT  1 

_  2  n 

k  (a  .sin6+6  .cos9)  k/27r 

V  3  D 

.if 

|2TTq  ^1 

.j=l  (a+ij^+i^)/b 

°6* 

(5-26) 


Invoking  the  identities  of  Eqs.  (4-29)  and  (4-30) ,  Eq. 
(5-26)  simplifies  to 

.-1 


CRB 


b^T  k  ^  2  1  I 


(5-27) 


Finally,  the  definitions  of  Eqs.  (4-59) ,  (4-60)  and  (4-77) 
can  be  used  to  write  the  error  variance  as 


E[e‘]  ^ 


1  +  Ego2/(2Ti’2q(a+ij^+i^)Yj,] 


(a+ij^+i^)  >>  b  , 


(5-28) 


84 


or  in  the  limit  as  k  becomes  very  large 


l.i.m.  E[cJ]  >  - - -  ,  i,  >>  b  .(5-29) 

k  -»■  ®  1  + 


Because  of  the  requirement  that  ij^  >>  b,  small  SNR,  the 
error  is  limited  by  the  variance.  Furthermore,  the  actual 
error  variance  may  be  much  higher  due  to  the  inherent  SNR 
restrictions  needed  to  apply  Eq.  (5-22) .  For  the  ML  esti¬ 
mate,  Eq.  (5-28)  becomes 

2 

271  q(a+i,+i,) 

Var(er)  2  - 2 -  ,  ij^  >>  b  ,  (5-30) 

a 

and  Eq.  (5-29)  becomes 


1 .  r .  in . 
k  -»■  <» 


q(a+ij^+i^) 


ib  >  b 


(5-31) 


Equation  (5-31)  tells  that  the  ML  estimate  obtained  from 
Eq.  (4-48)  ,  when  used  as  an  approximation  for  background 
limited  conditions,  results  in  a  bad  estimate. 

The  foregoing  discussion  is  based  on  Wyant's  assertion 
(Ref  16:2624)  that  fox'  shot  noise  limited  conditions,  a 
better  method  for  measuring  phase  is  as  given  by  Eq.  (5-19) , 
F(0)  =  0,  easily  implemented  by  letting  k  =  4.  (See  Table 
V,  Background  Limited.)  The  performance  of  Eq.  (4-48) 
analogous  to  Eq.  (5-19)  for  ML,  was  proven  to  be  exclusively 
SNR  dependent.  The  equivalent  noise  2q(a+ij^+i^)  also  makes 
the  result  of  Eq.  (5-19)  SNR  dependent,  and  the  estimate  is 
likely  to  be  erroneous.  (See  Eq.  (5-31).)  A  better 
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t 


equation  is  given  also  in  Table  V  for  y  =  1/  but  the  solu¬ 
tion  is  not  so  straightforward.  The  analysis  performed  by 
Wyant  gives  an  error  variance  (dark  current  neglected) 


Var'^r'  ’  <''♦1^ 


(5-32) 


4y"  Z  r. 
j=l  ^ 

where  the  photon  counts  can  be  evaluated  from  external 
parameters.  Comparison  of  Eg.  (5-32)  to  Eq.  (5-30)  using 
Eq.  (5-9)  gives  the  following  inequality: 

Tr^(a+i,  )^  4Tr^q(a+i.) 

- k -  -  - k  '  \  ^  •  (5-33) 


4b^  Z  r. 
j=l  ^ 


-  9 

b^T  k  Z 

j=l  ^ 


For  the  particular  case  of  k  =  4,  Eq.  (5-33)  yields 


(a+ij^)  k  |(rj^+r2+r3+r4)  >>  b  . 


(5-34) 


From  Eqs.  (5-32)  and  (5-9),  Wyant's  approach  is  found  to  be 
also  SNR  dependent.  Any  good  performance  thus  depends  on 
the  condition  that 


Z  r.  >>  (a+i  ) 

j=l  3  ^ 


(5-35) 


From  Eq.  (5-34),  obviously  Eq.  (5-35)  is  not  true. 

The  foregoing  results  clearly  indicate  that  phase  esti¬ 
mation  requires  a  strong  signal  regardless  of  the  noise 
process.  It  can  be  noted  that  under  low  light  level  condi¬ 
tions,  the  SNR  constraints  are  more  difficult  to  meet,  and 
large  errors  may  be  expected.  With  these  conclusions,  the 
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analysis  of  phase  difference  estimation  from  measurements 
from  a  single  detector  are  completed  within  the  scope  of 
this  thesis.  Given  the  added  complexity  of  Eq.  (5-8)  and 
time  constraints,  a  simulation  as  performed  in  Chapter  IV 
is  not  included.  Estimation  of  the  actual  phase  components 
^  in  Eq.  (2-3)  will  be  performed  in  the  next  chapter  using 
the  measurements  of  the  detector  array  as  a  whole. 
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VI  Joint  Processing  of  Array  Signals 
for  Wavefront  Estimation 

The  problem  of  phase  difference  estimation  was 
addressed  in  Chapters  IV  and  V.  Ideally  what  is  desired 
are  the  actual  phases  at  specific  points  across  the  aper¬ 
ture  rather  than  slopes  between  points.  The  methods  usually 
employed  to  find  these  phases  consist  of  mappings  of  data 
given  by  the  measured  wavefront  difference  functions  A({)  (r  ) 

Si 

and  the  geometry  of  the  data  points.  These  mappings  use  the 
concepts  of  least  squares  fitting  and  are  discussed  by 
Fried  (Ref  1) ,  Hardy  et  al  (Ref  4) ,  Hudgin  (Ref  5) ,  Rimmer 
(Ref  10),  and  Wyant  (Ref  16)  among  others.  The  general 
result  given  by  these  mappings  is  an  average  of  phases  and 
measurements  about  a  single  point,  requiring  a  recursive 
solution. 

A  different  approach  will  be  used  in  this  thesis  which 
uses  the  fact  that  the  phases  across  the  aperture  are  spa¬ 
tially  correlated  due  to  slow  spatial  variation  of  the 
wavefront,  and  the  assumption  that  a  spatial  covariance 
matrix  is  available  from  experimental  measurements.  There¬ 
fore,  by  jointly  processing  the  outputs  of  the  two  detector 
arrays  discussed  in  Chapter  II,  a  phase  estimate  can  be 
made  in  real  time.  Because  of  the  a  priori  information 
supplied  by  the  covariance  matrix,  an  improvement  is  expec¬ 
ted  over  mappings  of  data  points. 

The  criteria  of  Maximum  A  Posteriori  estimation  will 
be  used  in  this  chapter,  but  the  algorithm  so  obtained  will 


88 


be  in  continuous  waveform  rather  than  discrete  counts. 
Because  of  the  difficulty  in  applying  the  MAP  theory,  sev¬ 
eral  assumptions  need  to  be  made  in  order  to  simplify  the 
algorithm  derivation.  The  problem  will  be  worked  out  in 
the  Gaussian  context  or  second  moment  models  only,  where 
detector  thermal  noise  is  the  predominant  noise  source.  The 
noise  waveforms  from  each  detector  are  samples  from  indepen¬ 
dent  white  Gaussian  processes  with  zero-mean  and  strength 
N|^/2.  The  subscript  k  will  be  used  to  index  the 
detector.  The  phases  4){r,t)  will  be  assumed  to  be  stepwise 
constant  in  the  interval  sequence  (O,  T,  2T,  ...)  as  pre¬ 
sented  in  Chapter  I.  The  covariance  matrix  is,  therefore, 
constant  in  each  interval ,  but  needs  to  be  updated  every  T 
seconds.  This  is  the  sequential  problem  which  is  beyond 
the  scope  of  this  thesis.  A  suggestion,  however,  will  be 
given  later  for  sequential  estimation  by  quantizing  0(r,t) 
when  it  is  continuous  in  time.  The  final  assumption  to  be 
made  is  that  of  a  jointly  Gaussian  random  phase  distribution 
over  the  aperture.  The  time-space  problem  fitted  to  the 
above  description  is  one  of  multiple  channel,  multiple 
parameter  estimation  in  Gaussian  noise. 

Multidimensional  Estimator  Formulation 

Derivation  of  the  required  time-space  estimator  equa¬ 
tion  will  be  made  in  this  section.  The  next  section  will 
treat  the  specific  application  to  the  shearing  interferome¬ 
ter  output.  So,  the  multiple  observation  model  can  be 
written  in  vector  form  as 


r(t)  =  s(t,a)+n(t),  0<t<T, 


(6-1) 


where  r{t)  is  a  column  vector  of  the  outputs  from  a  two- 
dimensional  detector  array,  indexed  with  a  single  subscript 
1  <  K  <  m,  A  vector  of  phase  parameters  a  over  the  aper¬ 
ture,  indexed  also  with  a  single  subscript  1  <  p  ^  n,  is  to 
be  estimated  jointly  using  all  available  outputs.  T)ie  for¬ 
mulation  needed  for  array  processing  is  a  direct  extension 
of  the  single  element  case.  Let  the  output  from  the 
detector  be 


r^(t)  =  s^(t,a)  +  n^(t) 


(6-2) 


The  noise  statistics  are  given,  from  Eqs.  (4-2)-(4-6),  as 


E[n^(t)]  =  0 


E[n^(t)n^(f)]  =  ^^6(t-t') 


(6-3) 


w 

A  K 


6(t-t’)  . 


(6-4) 


It  is  assumed  that  the  noises  in  the  detectors  are  statis¬ 
tically  independent  of  each  other.  The  detector  outputs  r^ 
are  also  statistically  independent  as  was  shown  in  Eqs. 
(4-9) - (4-13) .  Therefore,  an  array  output  vector  can  be 
defined  as 


B  • 


.  r  . . .  r  ] 

—  K  -m 


(6-5) 


1 


and  the  conditional  density  of  the  observations  given  the 
phase  vector  is  found  from 


m 

n 

K  =  1 


(r,  !a) 


-K  I  — 


(6-6) 


where  f  i  is  given  by  Eq-  (4-14'  The  likelihood 

— K  '  — 

function  defined  by  the  ratio 


A 


A 


(6-7) 


is  therefore  given  by 


A  =  ^ 

T 


m 

Z 


k 

Z 


K=1  j=l 


r  . 
1 


(^  ) 


W 


(a)  -  2^-  (s]^^  (a)) 2 


(6-8) 

Equation  (6-8)  must  be  maximized  by  the  proper  choice  of 
all  the  elements  a^  in  a.  Since  the  phases  across  the  aper¬ 
ture  are  assumed  jointly  Gaussian  and  spatially  correlated, 
they  can  be  represented  in  a  different  coordinate  system 
where  the  new  elements  a^^  are  independent  Gaussian  random 
variables,  each  with  density 


f  (a.  ) 
a .  1 

1 


[2110^  ]  ^'^^exp  [-a?/2o^  ] 
1  1 


(6-9) 


Beginning  the  estimator  derivation  with  Eq.  (6-9) ,  the  MAP 
estimate  of  a.  can  be  obtained  by  maximizing  {A  +  Inf  (a.)} 

X  3.  .  X 

1 

with  respect  to  a^.  The  result  in  vector  form  is 


_  2,  k  3s^  (a) 

^  V  3 
T  3aj  -  ^-j  -j 


W  ^ [r  .  -  s . (a)  ]  - 


a . 
1 


=  0 


(6-10) 
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By  using  Eqs.  (6-14)  and  (6-15),  Eq.  (6-10)  becomes 


a .  = 

1 


I  ({)Td.  (a)w‘^[r  .  -  s  .  (a)  ] 
T  a^  - '■-j  -j 


(6-16) 


By  using  Eq.  (6-16)  in  Eq.  (6-12) ,  the  joint  estimate  is 


a  = 


2q^k 


^2  T 

1.  JL.m.  E  a  di .  d>-. 

N  ->  00  i=i 


.,-1, 


E  D. (a)W  [r .  -  s . (a) ] 
j=l  -3  -  -3  -3  - 


The  covariance  matrix  is  given  by  (Ref  12:222) 

2  T 

k,  =  l.i.m.  E  (p.^T 

^  N  -»■  “  i=l  1  1 


(6-17) 


(6-18) 


and  is  assumed  to  be  known  in  the  interval  (tj^,tj^+T).  It 
is  further  defined  by 


k  -  E[a  a"^] 

”“3  ” 


k,  .  . .  k, 

1  In 


k  ,  k 

nl  n 


(6-19) 


Therefore,  the  discrete  joint  MAP  estimate  of  a  is  given  by 

a(r)  =  k  E  D.  (a)W  [r  .-s  .  (a)  ]  ,  t.  i  t  <  t,+T  . 

—  —  ■*•  — 3  j_i  — J  —  —  — J  — J  —  -*• 

(6-20) 

Equation  (6-20)  can  be  converted  into  a  continuous  form  by 
substituting  for  the  definition  of  Eq.  (3-14) .  By  observing 
that,  for  a  set  C (t)  of  complete  orthonormal  functions, 
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a 


6 (t-u) 


(6-21) 


k 

l.i.m.  Z  C.  (t)c-(u)  = 

00  j  =  ]_  J  -> 

then  Eq.  (6-20)  becomes 


a  =  2k 


ISa  J  D(t,a)w“^(r 


(t)  -  s (t ,a) ] dt 


for  0  <  t  <  T  , 
where 


(6-22) 


D(t,a)  - 


ds^ (t,a) 
9a, 


9Sj^  (t,a) 

dcL 


9Sj^(t,a) 

az; 


9s^(t,a) 


(6-23) 


n 


Equation  (6-22)  is  the  joint  MAP  'estimate  from  a  continuous 
measurement  of  a  Gaussiai:  random  vector  a  observed  in  Gauss¬ 
ian  noise.  This  is  the  result  that  will  be  used  in  the 
forthcoming  developments  to  process  jointly  the  outputs  of 
a  shearing  interferometer.  Equation  (6-22)  can  also  be 
obtained  in  a  manner  analogous  to  the  single  detector  by 
performing  a  correlation-summing  operation  as  shown  in 
Figure  20  (Ref  14:367,452,453). 


Wavefront  Estimation  from  the  Shearing  Interferometers 

The  output  fields  of  the  interferometers  at  the  focal 
planes  are  received  and  processed  by  two  separate  detector 
arrays,  one  each  for  the  X  and  Y  sheared  fields  as  depicted 
in  Figure  2.  Therefore,  it  will  be  convenient  to  keep 
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Fig  20.  Correlator-Summer  Model  for  Joint  Processing 

track  of  the  observations  by  use  of  appropriate  subscripted 
notation.  Define 


tor  in  each  of  the  x-  and  y-arrays.  The  subscripts  <  and  il 
index  the  rows  and  columns  respectively.  For  any  one  detec¬ 
tor,  the  signals  can  be  written  as 

y(t,d)  =  a  +  b  sin (wt  +  0 (y  ) ) 


for  fixed, 

a 


(6-26) 


1 


and 


x(t,0)  =  d  +  c  sin(wt  +  6 (x  )) 


for  fixed, 

a 


(6-27) 


where  the  difference  functions  are,  from  Eqs.  (2-3)  and  (3-1) 
(3-5) ,  given  by 

e(x  )  =  I  {4)(x  -  M  ,y  )  -  (j.(x,  +  ,y^)}  (6-28) 


a  s^'^a 


a 


0  (Ya)  =  (<J>(x^,y^  "  ^  ~ 

d  ‘’d 

Further  notational  simplification  can  be  made  by  defining 


<t>  (x“) 

A 

<t.(Xa  - 

(6-30) 

(x'*’) 

A 

(6-31) 

<l>  (y") 

A 

(6-32) 

+~' 

-e- 

A 

(6-33) 

so  that  Eqs.  (6-26)  and  (6-27)  can  be  written  as 


y(t,9)  =  a  +  b  sin(a)t  +  ^  ^  ^ ) 


for  X  fixed, 
a 

and  x(t,6)  =  d  +  c  sin(wt  +  - — 2"^'~  ^ 


for  y  fixed, 
a 


(6-34) 


(6-35) 


Equation  (6-22)  can  now  be  applied  using  Eqs.  (6-24)  through 
(6-35)  to  perform  the  joint  estimate  of  the  wavefront  phases 
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using  two  plane  detector  arrays  properly  interfaced  to 
couple  the  x-  and  y-measurements  with  the  same  wavefront 
points.  But  the  two-array  configuration  must  be  set  up 
before  proceeding  to  perform  the  joint  phase  estimation. 

Configuration  of  the  Detector  Arrays.  The  need  for  the 
use  of  two  detector  plane  arrays  for  actual  phase  estimation 
is  a  consequence  of  the  structure  of  the  difference  func¬ 
tions  in  Eqs.  (6-30)  through  (6-35),  obtained  from  the  use 
of  lateral  shearing  instead  of  radial  shearing  interferom¬ 
etry.  As  is  the  case  in  the  approach  of  data  mappings,  pro¬ 
cessing  of  the  observations  r(t)  requires  a  specific  detec¬ 
tor  arrangement.  In  order  to  make  full  utilization  of  the 
information  collected,  the  detectors  (and  the  arrays)  must 
be  arranged  so  that  each  wavefront  point  be  measured  by  as 
many  detectors  as  possible  in  order  to  provide  a  strong 
deterministic  relationship  between  the  lu  >asurements  over  the 
aperture.  All  other  coupling  is  provided  statistically  by  the 
covariance  matrix  of  the  phases. 

In  order  to  provide  redundancy  of  measurements,  the 
best  possible  detector  arrangement  is  as  shown  in  Figure  21. 
This  arrangement  allows  each  phase  point  to  be  measured  by 
four  detectors,  two  from  each  one  of  the  arrays,  and  has 
the  advantage  that  only  one  reference  phase  is  required  to 
determine  the  entire  phase  distribution.  It  is  assumed  that 
such  phase  point  is  measured  by  another  means  or  is  set 
arbitrarily  equal  to  zero.  The  X's  and  Y's  in  Figure  21 
denote  the  detector  locations  and  the  4i's  denote  the  phase 
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Fig  21.  Arrangement  of  Two  Detector  Arrays  to  Measure 
(m+l)x(n+l)  Phase  Points  with  (m+l)xn  X-  and 
mx(n+l)  Y-Detectors 


points  being  measured.  Except  for  the  edge  phase  points, 
each  phase  <p^  ^  is  measured  by  detectors  y^^^, 

and  y^  „  , ,  being  a  common  phase  point  of  the  four  dif- 

ference  functions  r  and 

The  grid  shown  in  Figure  21  is  formed  by  overlapping  the  two 
arrays  to  indicate  the  relative  positions  of  the  X-  and  Y- 
shear  detector.  The  corresponding  detectors  x^  ^  and  y^^^  are 
not  located  at  the  same  point  on  the  field.  They  are  dis¬ 
placed  45®  instead  so  that  both  can  measure  the  same  phase 
The  direction  of  the  shear  has  been  chosen  from  right 
to  left  and  bottom  up  to  correspond  to  the  notation  adopted 
in  Eqs.  (6-34)  through  (6-37).  In  this  arrangement  the 
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detector  spacing  has  been  chosen  so  that 


(6-36) 

and 

(6-37) 

in  Eqs.  (6-30)  through  (6-35) .  The  detectors  must  then  be 
spaced  2M  shear  distances  s^  apart  and  s^^  may  not 

necessarily  be  equal) .  This  is  depicted  in  Figure  22  for  a 
column  array. 

In  general,  for  a  non-square  array  of  phase  points,  the 
number  of  detectors  is  different  in  each  array.  Thus, 
referring  to  Figure  21,  there  are  [m+l]xn  X-detectors  and 
mx[n+l]  Y-detectors,  and  [m+1] [n+1]  measured  phases.  The 
ratio  of  detector  to  phase  points  is 


[m+1] xn  +  nx [n+l] 
[m+1] X [n+l] 


and  ranges  from  unity,  when  a  minimum  of  four  phases  are 
measured  with  two  each  X-  and  Y-detectors,  up  to  the  value 
of  two,  in  the  limit  when  a  very  large  number  of  phase 
points  are  being  measured.  Thus,  in  the  best  possible 
arrangement  of  Figure  21,  the  number  of  detectors  required 
tends  to  double  as  more  wavefront  points  are  measured  simul¬ 
taneously.  With  these  preliminaries  completed,  derivation 
of  the  algorithm  for  wavefront  estimation  by  jointly  pro¬ 
cessing  the  two  detector  array  outputs  can  be  initiated. 

Joint  Wavefront  Estimation  with  Two  Detector  Arrays. 

The  observations  r(t)  can  be  arranged  in  a  column  vector  of 
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Fig  22.  Arrangement  of  Detector  Column  Array  for 
Joint  Processing 


dimension  (m[2n+l]+n),  using  the  notation  of  Eqs.  (6-24)  and 
(6-25) .  Thus / 


r(t)  (t)  (t)  ...  /Wj^j^(t) 


(6-36 


where  z^^(t)  and  w^^(t)  are  the  observations  with  signal 
components  given,  after  Eqs.  (6-34)  through  (6-37),  by 

4>, 

for  0  <  t  <  T, 

1  i  K  <  m, 

1  <  X.  <  n+1. 


y.o(t,i)  =  a^^  +  b^^sin(a)t  + 


100 


(6-39) 


£,  +  1 

and  sin(oJt  +  — ^ - ~'> 

for  0  <  t  <  T, 

1  <  K  <  m+1, 

1  <  «,  <  n  .  (6-40) 

The  arguments  (x^)  and  (y^)  used  in  Eqs.  (6-34)  and  (6-35) 
are  dropped  from  Eqs.  (6-39)  and  (6-40)  since  the  ambiguity 
is  taken  care  of  by  the  subscripts  K+l,i  and  k,£+1,  and 
because  (t)^g^(x'^)  =  <}>  (y'*')  • 

The  phase  vector  is  also  a  ([m+l][n+l])  column  matrix 


=  [4> 


11 


pq 


^m+l,n+l 


(6-41) 


and  has  a  ([m+l](n+l])  symmetrical  covariance  macrix  with 

terms  from  to  Vl,n+l;m-M,ntl-  covariance 

matrix,  on  the  other  hand,  is  (m[2n+l]+n)  diagonal  (noises 
were  assumed  spatially  uncorrelated)  given  by 


W 


M 


11 


O 


O 


N  ...1 
n,n+l 


(6-42) 


|_  «mtl,nj 

where  the  N's  and  M's  make  reference  to  X-  and  Y-detector 
noises  respectively.  With  the  observation  and  covariance 
matrices  defined  in  Eqs.  (6-38)  through  (6-42) ,  Eqs.  (6-22) 
and  (6-23)  can  be  applied  directly  to  perform  the  joint 
estimation  of  the  vector  ((). 
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Thus,  the  product  W  ^[r(t)  -  s(t,^)]  in  Eq.  (6-22)  is 
a  (n[2n+l]+n)  column  matrix  with  general  terms 

Ky, 

for  1  <  <  m, 

1  <  8.  <  n+1,  (6-43) 

and 

M— 

for  1  <  f’"  <  m+1, 

1  ^  il  <  n  .  (6-44) 

The  signal  derivative  matrix  is  ( [m+l] [n+1] ) x (m [2n+l] +n) 
dimensional  given  by 


the  MAP  estimate  of  the  pq 
terms  to  be  given  by 


phase  is  found  in  general 
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Z  Z  — 
k:=1  £=1 
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ID 


~'^kZ 


m+l  n  ^  3x  ,  (t,^)  ) 

+  Z  Z  /  - i- - [w  (t)-x  (t,(|))]dt  .  (6-46) 

K  =  1  Jl=l^  KZ  -  ^ 

Finally,  when  Eqs.  (6-39)  and  (6-40)  are  used  in  Eq.  (6-46) , 
the  general  pq^^^^  member  of  the  ^  vector  of  Eq.  (6-41)  is 
obtained.  The  result  is  given  by 


m  n+1  b 


k:=1  £=1  ^k£  ^’^pq;K+l,£  "  ^pq,K£^ 

X  ^ 

//.  .  -  .  .  '^k+1,£ 

(t)cos  (wt  +  - '—2 - )dt 


m+l  n  C 


kS, 


^  ^  M  ^^ua'ic  £+1  ~  ^DQ  k£^ 

K=1  £=1  ^k£  Pq.</i!'+-l-  pq/KX, 


J  (t)  cos  (cot  +  I - ^)dt 


for  0  <  t  ^  T, 

1  <  p  <  m+l , 

1  i  q  <  n+1,  (6-47) 


Equation  (6-47)  is  the  joint  Maximum  A  Posteriori  phase 
estimator  using  measurements  of  two  orthogonal,  lateral 
shearing  interferometers  and  their  detector  arrays.  The 
phase  estimate  distribution  over  the  aperture  is  shown  in 
Figure  23. 


1.  i 


Fig  23.  Phase  Estimate  Distribution  Over  the  Aperture 

The  algorithm  given  by  Eq.  (6-47)  is  applicable  only 
when  the  phases  ^  being  measured  are  jointly  Gaussian  ran¬ 
dom  variables  and  the  a  priori  information  represented  by 

k_  is  available.  The  distinction  between  $  and  (p  in 
—a  ^pq  ^rs 

Eq.  (6-47)  is  made,  observing  the  structural  form  of  Eg. 

(6-46),  by  the  covariance  terms  k  . .  and  k  • •  only, 

pcj  rlj  rS/lJ 

where  i  and  j  take  on  all  values  from  1  to  m+1,  and  1  to 
n+1  respectively.  Therefore,  the  joint  phase  estimates  are 
weighted  accordingly  by  the  a  priori  information.  This 
will  be  further  explained  in  a  forthcoming  example. 

Solution  of  Eq.  (6-47)  is  to  be  obtained  recursively 
with  numerical  methods  on  a  digital  computer.  However,  to 
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illustrate  the  estimator,  Eq.  (6-47)  can  be  implemented  with 
a  heuristic  correlator-summer  of  considerable  complexity  as 
will  be  seen  in  the  following  example. 


Illustrative  Example.  The  simplest  example  to  illus¬ 
trate  a  hardware  implementation  of  Eq.  (6-47)  is  given  by 
the  joint  estimate  of  a  wavefront  at  four  locations  using 
two  (2x2)  detector  arrays.  The  arrangement  is  shovm  in 
Figure  24.  The  notation  is  so  chosen  for  simplicity. 


Detector  Y2  corresponds  to  and  phase  corresponds  to 

in  the  notation  of  Eq.  (6-47).  It  will  also  be  assumed 


that  the  noises  are  of  equal  strength  and  the  signals 
have  equal  amplitude  /2E^/T.  With  the  problem  so  defined, 
the  p  phase  estimate  is  given,  from  Eq.  (6-47)  ,  by 


<P 

P 


^  ) 


/"  3'”'^  1 

(kp^-kpj^)  /  Zj^(t)cos(wt  +  — 


T  >1 


r  ^"^2 

+  (kp^-kp2)  /  Z2(t)cos(ut  +  — ^)  dt 


T 

w  (t)cos(ijot  +  — 


r  ^4“^  I 

+  (^p4“^p3)  /  W2(t)cos(a)t  +  — ,  (6-48) 


where  any  is  set  as  the  zero  reference.  Equation  (6-48) 

can  be  implemented  with  a  correlator-sumsmer  as  shown  in 

4*  •  ~4<  ■ 

Figure  25,  where  cos  (cot  +  ^ )  is  denoted  by  ^  The 

z  13 

structure  of  Figure  25  is  a  feedback  system  in  which  the 
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-  X- Shear 

Fig  24. 

Phase-Detector 

Grid  for  Joint 

Processing  of  Four 

Detectors  to  Estimate  Four  Wavefront  Phases 


detector  outputs  are  weighted  by  the  covariance  terms  so 
that  each  phase  estimate  has  a  contribution  from  each  meas¬ 
urement.  Three  feedback  loops  from  the  estimate  outputs 
(^>21  4>3  and  4)^  are  returned  to  the  detector  inputs.  The 
estimate  0^^  does  not  provide  any  feedback  for  being  the 
starting  point  of  the  spatially  recursive  estimation.  The 
phase  output  has  been  arbitrarily  set  as  the  zero  refer¬ 
ence. 

To  better  understand  how  this  formidable  structure 
weighs  the  measurements,  or  better  yet,  what  Eq.  (6-48) 
really  does,  the  underlying  mechanism  is  shown  in  Figure  26. 
Here,  the  phase  points  have  been  replaced  by  the  covariance 
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Fig  25.  Correlator-Summer  to  Estimate  Four  Phases 


I 


terms  of  <1)^  with  respect  to  the  other  phases.  The  detector 
locations  have  been  denoted  by  their  measurements  z (t)  and 
w(t) .  Figure  26-a  shows  the  mechanism  involved  in  esti¬ 
mating  (t>2*  Each  phase  location  has  the  covariance  term 
involving  4)^  and  the  phase  at  each  point;  this  is  the  moan¬ 
ing  of  the  arrows.  The  operation  involves  taking  pairwise 
the  difference  between  the  covariance  terms  at  all  points 
and  multiplying  by  the  measurement  made  between  each  two 
points  (this  means  the  output  of  the  integrator) .  Thus, 
the  difference  between  and  k^^  is  multiplied  by  zero 

because  there  is  no  detector  in  between.  In  the  same  man¬ 


ner,  Figure  26-b  shows  the  same  mechanism  used  to  evaluate 
<t>2.  Here,  the  covariance  terms  are  ^31'  ^32  ^34* 

This  can  be  generalized  for  any  number  of  array  elements. 

From  this  example,  it  is  evident  that  the  algorithm 
indeed  provides  a  means  for  jointly  processing  spatially 
correlated  phase  measurements.  It  also  implies  a  simultane¬ 
ous  evaluation  of  all  phase  points  and  a  recursive  substi¬ 
tution.  The  difficulty  involved  in  solving  Eq.  (6-47) 
numerically  is  also  evident  due  to  the  redundancy  provided 
by  the  feedback  loops.  Therefore,  such  a  solution  goes 
beyond  the  scope  of  this  paper. 


Estimator  Performance 

The  performance  of  the  estimator  of  Eq.  (6-22)  is  more 


easily  described  in  terms  of  the  Cramer-Rao  bounds,  provided 
that  the  signal-to-noise  ratio  is  large  and  biases  are 


negligible.  Then,  the  lower  bounds  on  the  error  matrix  are 
the  diagonal  terms  of  the  matrix 

Sb  =  ISa'i 

adapted  from  a  more  general  case  given  by  Van  Trees  (Ref  13: 
454).  Simplification  to  the  point  of  Eq.  (6-49)  is  possible 
because  the  matrix 

D.(u,(}))  - 

— Q  — 

A 


T 

/ 


E[D(t,4.)w"^D'^(t,<,)  ]dt 


°a 


(6-50) 


is  constant  over  (0,T)  for  s(t,^)  defined  as  [a+bsin (wt+^) ] . 

To  find  the  MS  error  bounds,  the  matrix  operations  of 

Eqs.  (6-50)  and  (6-49)  must  be  performed.  Carrying  out 

these  matrix  multiplications  is  a  rather  cumbersome  tas)<;. 

After  some  worJc,  the  matrix  D^(u,(|))  is  found  to  have  a 

— «  — 

diagonal  band  form  with  dimensions  (m[2n+l] +n) x ( [m+1] [n+1] ) 
as  illustrated  in  Figure  27.  The  five  X's  in  each  column 
are  niven  by 


2 

'p-l,q  \^‘^p-l,q  ^'^pq  / 

_  /3x  ,  9x 

__2 _  f  ...PrSL--!  —P/H-l] 
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(6-51) 


(6-52) 
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The  product  D,(u,(+i)k^  is  a  ( [m+l]  [n+1]  )  square  matrix 

—a  —  — ^ 

general  term  at  location  pq,ij  is  given  by 


< 


for 

N»y-'  i<P<m,  lSq^n+1 

M,x:  1  <  p  <  m+1  ,  1  i  q  <  n 

1  <  i  i  m+1  ,  1  s  j  i  n+1  .  (6-56) 

Integration  of  the  matrix  with  general  term  given  by  Eq. 

(6-56)  gives  a  constant  matrix  whose  general  term  at 

location  pq,ij  is  given  by 
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^Vl,q  «p,q.i 

*  7  +  ^p.q- 

^  1%  Vl,q  V  «p,a- 


I  pq  Vl,q  "pq  "p,q-l)  P'3-i^ 

c^  h2 

-  k  pq  . 

"pq  Vl,q;ij  * 


(6-57) 
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The  matrix  D_  is  ([m+1] [n+1])  square  and  p,  q,  i,  and  j  are 
bounded  as  in  Eq.  (6-56) .  Further  evaluation  of  the  error 
matrix  is  to  be  done  numerically  to  yield  the  diagonal 
values  of  Rg  defined  by  Eq.  (6-49) .  This  evaluation  as 
well  as  a  numerical  solution  of  Eq.  (6-47)  are  not  included 
in  this  paper  due  to  the  difficulties  encountered  in  under¬ 
taking  that  task.  This,  however,  does  not  discourage  fur¬ 
ther  study  since  Eqs.  (6-47)  and  (6-57)  are  believed  to  be 


correct. 


VII  Conclusions  and  Recommendations 


Conclusions 

The  purpose  of  this  work  was  twofold:  (1)  perform  a 
discrete  phase  estimation  analysis  for  a  single  detector 
measurement,  and  (2)  perform  a  joint  phase  estimation 
analysis  for  multiple  detector  measurements.  This  work  was 
prompted  by  the  need  for  improved  processing  techniques 
directly  applicable  to  shearing  interferometry  and  wave- 
front  correction  systems. 

'  The  first  workfront  motivation  was  to  determine  in  a 
stochastic  sense  if  phase  estimation  algorithms  with  the 
simplicity  of  the  form  of 

<1.  =  tan"^  l|^]  ,  (7-1) 

from  Reference  16,  intended  for  use  with  low  level  signals, 
could  be  obtained  using  Maximum  A  Posteriori  and  Maximum 
Likelihood  estimation  theories,  and  the  conditions  for  which 
they  would  give  good  phase  estimates.  A  family  of  such 
algorithms  was  found  and  is  given  by  the  ML  estimator  Eq. 
(4-48) ,  which  is  a  specialized  result  of  the  MAP  Eq.  (4-36) , 
derived  under  the  white  Gaussian  noise  assumption.  A  simi¬ 
lar  result  is  also  given  by  Eq.  (5-8) ,  derived  under  the 
Poisson  shot  noise  assumption,  for  the  case  of  low  fringe 
visibility.  It  is  shown,  however,  without  empirical  veri¬ 
fication  (computer  simulation) ,  that  this  algorithm  form 
will  result  in  poor  estimates  under  those  modeling  conditions. 
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The  performance  analyses  carried  out  in  Chapters  IV 
and  V  (both  theory  and  simulation)  show  that  the  only  factor 
affecting  phase  estimation  performance  is  the  SNR  regardless 
of  the  noise  process  involved  and  the  technique  used.  In 
fact,  the  performance  of  both  MAP  and  ML  estimators  is 
asymptotically  given  by  the  reciprocal  SNR  with  an  appro¬ 
priate  scaling  factor  for  the  particular  algorithm  form 
used  (given  k) .  Emphasis  is  made  on  this  point  to  clear  up 
Wyant's  implication  that  Eq-  (7-1)  might  be  free  from  the 
SNR  restrictions. 

The  estimators  of  Eqs.  (4-39)  and  (4-48)  were  verified 
with  a  simple  computer  simulation  with  results  presented 
in  Figures  10-19.  The  similarity  of  performance  between 
the  MAP  and  ML  estimators  particularly  for  SNR  >  10  dB 
follows  directly  from  the  implications  of  Eqs.  (4-91)  and 
(4-95) ,  which  predict  the  same  asymptotic  performance  of  the 
MAP  and  ML  phase  estimators  for  large  SNR,  and  the  implica¬ 
tion  of  Eq.  (4-90) ,  which  tends  to  ignore  a  priori  informa¬ 
tion  as  the  noise  in  the  measurement  decreases. 

For  a  given  SNR,  there  exists  a  tradeoff  between  algo¬ 
rithm  simplicity  and  algorithm  performance  of  the  discrete 
estimators.  The  increased  structural  complexity  as  k 
increases  from  k  =  3  to  infinity  is  illustrated  with  a  few 
examples  in  Tables  I,  II,  IV  and  V.  This  complexity  is 
particularly  noticeable  in  the  Poisson  analysis  equations. 
The  return  of  using  the  more  complex  forms  is  an  improved 
performance  as  shown  in  Table  III  and  plotted  on  a  relative 
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basis  in  Figure  9.  The  difference  in  theoretical  perform¬ 
ance  between  the  worst  (k  =  3)  and  the  best  (k  -  <»)  possible 
estimator  forms  is  1.65  dB  according  to  the  definition  of 
Eq.  (4-76) ,  and  decreases  quite  rapidly  as  the  number  of 
counts  (k)  increases.  On  the  basis  of  this  performance- 
complexity  tradeoff,  the  photon  counting  technique  (k  <  °°) 
is  suboptimal  (but  acceptable)  with  respect  to  current- 
measurement  (k  =  “)  based  phase  estimation. 

The  second  motivation  of  this  work  was  the  analysis  of 
a  time-space  problem  intended  to  provide  a  joint  estimate 
of  the  phases  across  the  aperture  of  the  interferometer. 

Such  joint  processing  had  not  been  addressed  in  the  light 
of  MAP  theory,  where  the  fact  that  the  phases  were  spatially 
correlated  could  be  used  to  improve  performance.  The  algo¬ 
rithm  derived  is  given  by  Eq.  (6-47)  and  is  restricted  to 
the  assumption  of  a  Gaussian  phase  distribution.  The 
weighting  between  measurements  is  explicitly  shown  by  the 
covariance  terms  in  Eq.  (6-47) .  The  difficulty  with  this 
algorithm  is  the  mathematical  form  of  simultaneous  non¬ 
linear  integral  equations  for  which  a  solution  is  not 
readily  available.  The  performance  equations  were  carried 
out  up  to  the  point  of  numerical  evaluation,  which  is  not 
included  due  to  time  constraints  imposed  by  the  difficulties 
encountered  in  deriving  the  algorithms.  Thus,  a  direct 
comparison  to  other  estimators  such  as  the  one  derived  by 
Hudgin  (Ref  5)  is  not  possible  given  the  form  of  Eqs.  (6-47) 
and  (6-57). 
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Recommendations 

The  study  contained  in  Chapters  IV  through  VI  was 
performed  for  the  extreme  cases  of  detector  limited  and 
signal  limited  noise  statistics.  It  is  not  unreasonable  to 
think  that  the  intermediate  cases  are  also  encountered  in 
the  reception  of  optical  fields,  where  both  thermal  and 
shot  noise  occur  together.  These  noise  processes  are  inde¬ 
pendent  of  each  other,  and  the  density  of  the  observables 
is  then  a  convolution  of  Gaussian  and  Poisson  functions. 
Estimation  under  these  conditions  should  be  tried. 

The  random  phase  in  the  argument  of  the  sine  function 
represented  both  target  and  turbulence.  Going  beyond  the 
application  of  wavefront  correction  systems,  it  may  be 
desirable  to  distinguish  target  and  noise  induced  phases. 

This  is  estimation  in  the  presence  of  unwanted  parameters 
and  should  also  be  considered. 

The  basic  assumption  of  the  analyses  presented  in  this 
paper  was  the  time  invariance  of  the  phase  in  the  measurement 
interval.  Although  the  staircase  approximation  to  the  phase 
process  may  be  suitable  for  slowly  varying  fields,  it 
requires  an  update  of  the  covariance  matrix  in  each  inter¬ 
val.  A  procedure  should  be  tried  for  which  0 (t)  is  time- 
dependent  throughout  the  measurement,  thus  freeing  the  esti¬ 
mation  problem  from  the  requirement  of  short  observation 
intervals.  This  could  be  done  by  homodyning  the  detector 
signal  and  filtering  the  frequency  domain  components  with  a 
low  pass  filter.  The  output  signal  would  then  have  the  form 
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r(t)  =  B  ~  sinO(t)  +  —  n  (t)  (7-2) 

^  /2 

where  B  is  a  gain  factor  given  by  the  amplitude  of  the  mix¬ 
ing  signal,  and  i^|^(t)  is  filtered  white  Gaussian  noise. 

This  waveform  can  nov/  be  used  in  Eg.  (3-15)  to  obtain 

r.  =  sinO .  +  —  n.  .  (7-3) 

3  2  3/23 

The  signal  has  now  been  quantized  and  is  in  a  suitable  form 
for  recursive  parameter  estimation  (Ref  2:319). 

The  joint  processing  algorithms  of  Eqs.  (6-23)  and 
(6-47)  were  derived  using  the  assumption  of  independent 
detector  noise  processes.  Development  of  an  algorithm  to 
include  the  case  of  spatially  correlated  noise  should  also 
be  considered.  Finally,  the  algorithm  of  Eq.  (6-47)  needs 
to  be  worked  out  to  an  implementable  form,  and  tested 
through  a  performance  evolution  by  means  of  Monte  Carlo 
simulation. 
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seems  to  be  inferior  to  current  measuring,  but  the  error  variance  is  only  1.65 
dB  larger  in  the  worst  case,  where  three  photon  counts  are  performed.  An 
extension  of  the  single  detector  analy.sis  is  made,  using  only  the  Gaussian 
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a  priori  l<.nowlcdge  available  througli  a  ccvarianco  matrix.  An  algorithm  for 
processing  continuous  waveform  measurements  is  developed,  but  no  computer  sim¬ 
ulation  is  included  due  to  difficulties  encountered  in  solving  t.he  feedback 
system  equations. 
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